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REMARKS 

Claims 3, 11. 12, 13, 14, 17, 18, 19, 20, 22, 30, 31, 32, 34, 35, 36, 44, 47, 
48, and 49 have been cancelled. Claims 37-39 and 40-43 have been amended. 
New claims 50-57 have been added. Claims 1,2, 4-10, 15-16, 21, 23-29, 37-43, 
and 50-57 are now pending in the application. No new matter has been added 
by amendment. Reexamination and reconsideration of the claims as amended 
are respectfully requested. 



Claim Rejections - 35 USC § 112, second paragraph 

The Examiner rejects claims 3, 11-13, 18-20, 22, 30-32, and 47-49 under 
35 U.S.C. 112 second paragraph, as being indefinite for failing to particularly 
point out and distinctly claim the subject matter which applicant regards as the 
invention. The Applicant traverses the rejection. 

The Examiner rejects claims 3 and 22. Claims 3 and 22 have been 
cancelled. 

The Examiner rejects claims 11 and 30, and dependent claims 12-13 and 
31-32. Claims 11, 12, 13, 30, 31, and 32 have been cancelled. 

The Examiner rejects claims 18 and 47, and dependent claims 19-20 and 
48-49. Claims 18, 19, 20, 47, 48, and 49 have been cancelled. 

The Examiner states that the dependent claims cited in this rejection fail 
to further limit the claims from which they depend. The Examiner suggests that 
the claims be placed in a product -by-process format. New claims 50-57 reflect 
that suggestion. The Examiner also suggests that the claims should be drafted 
in terms of methods of making a plant by comprising transforming the 
exemplified plant of claim 2 or 21 . New claims 54-57 reflect that suggestion. 



Claim Rejections - 35 USC § 112, first paragraph 

The Examiner rejects claims 3, 9-20, 22, 28-32, 34-44 and 47-49 under 
35 U.S.C. 112, first paragraph, as containing subject matter which was not 
described in the specification In such a way as to reasonably convey to one 
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skilled in the art that the inventors, at the time the application was filed, had 
possession of the claimed invention. The Applicant traverses the rejection. 
Claims 3, 11, 12, 13, 14, 17, 18, 19, 20, 22, 30, 31, 32, 34, 35, 36, 44, 47, 48, 
and 49 have been cancelled. Claims 37. 38, 39, 40, 41, 42, and 43 have been 
amended. New claims 50-57 have been added. 

The Examiner rejects claims 11, 15-16, 30, 34-35 and their dependent 
claims and states that the claims "are broadly drawn to any transgenic plant 
which contains any heterologous transgene of any sequence conferring any trait, 
and methods of making and using the transgenic plants." The Examiner rejects 
claims 3, 18-20, 22, and 47-49 and their dependent claims and states that the 
claims "are broadly drawn to any 'single gene conversion' plant comprising one 
or more traits including male sterility introgressed into the claimed variety by 
backcrossing or other traditional means, and methods of using these plants." 
The Examiner states that "no guidance has been provided for the introgression 
of any single trait from a multitude of non-disclosed and uncharacterized 
parentals into the claimed variety, wherein said introgression should result in 
successful expression of the desired trait but should not interfere with the 
expression of the remaining traits whose combination confers patentability to the 
instantly exemplified variety, and which introgression should not introduce 
unwanted linked genetic material into the exemplified cultivar which would disrupt 
its patentably unique genetic complement." 

Claims 3, 11, 12, 13, 18, 19, 20, 22, 30, 31, 32, 47, 48, and 49 have been 
cancelled to place the claims in the method and product by method format 
requested by the Examiner. New claims 50, 52, 54 and 56 are drawn to 
methods, while their depending claims are drawn to the products produced by 
those methods. The claims include the well known methods of producing 
backcross and transgenic conversion plants. The product by process claims are 
further limited by specified conversion or transgenic traits, which include the 
traits of insect resistance, herbicide resistance, disease resistance, waxy starch, 
and male sterility. 
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While the Examiner states that the claims encompass "any transgenic 
plant which contains any heterologous transgene of any sequence conferring 
any trait", the Applicant points out that it is not claiming so broadly as to claim 
any maize plant, regardless of source, comprising those traits. The Applicant is 
claiming PH5WB or a limited set of plants derived therefrom that have obtained 
significant genetic contribution from PH5WB. 

Applicant respectfully points out that examples of transgenes, genes, and 
traits that can be backcrossed into the PH5WB are given in the application on 
page 21 , lines 16-34, and also on page 23, line 20, through page 33, line 4. In 
order to expedite prosecution new claims 51 and 55 list the type of traits that 
may be conferred by backcross conversions and transgenes. Claim 51 also 
specifies that PH5WB is used at least twice as a recurrent parent in the 
development of a backcross conversion plant. Breeders, by using molecular 
markers, may obtain up to 98% genome identity between the backcross 
conversion and the recurrent parent after two backcrosses. See Marker-assisted 
Selection in Backcross Breeding, Openshaw, S.J. et al. Marker-assisted 
selection in backcross breeding. In: Proceedings Symposium of the Analysis of 
Molecular Data, August 1994, pp. 41-43. Crop Science Society of America, 
Corvallis, OR (1994) included as Appendix A . Inbred PH5WB transformed to 
comprise a transgene is also easily identifiable through the use of molecular 
markers. The transgenic version of PH5WB would have the same molecular 
profile as PH5WB, with the possible exception of a marker used in the profile 
that is located at the site of transgene insertion. However, in this case, the 
plethora of other identical markers would identify the line as a transgenic variant 
of PH5WB. 

Applicant points out to the Examiner that, at the present time, it is not 
typical that a transgene be incorporated into each newly developed line, such as 
PH5WB, by direct transformation. Rather, the more typical method used by 
breeders of ordinary skill in the art is to incorporate the transgene into a new line 
by taking an already transformed plant line and using it as a donor line to 
produce a backcross conversion. Thus, the well established method of 

13 

Received (torn < 515 3^4 6833 > at 3/1 1/03 4:07:36 PM [Eastern Standard Time] 



03/11/03 TUE 16:11 FAX 515 331^83 PIONEER HI-BRED DSM ®019 



• 



SN: 09/758,790 

backcrossing has been used and is the most common means of introgressing 
the claimed traits into new material. 

In the specification on page 4, lines 7-13, it states, "Backcrossing can be 
used to transfer a specific desirable trait from one inbred or source to an inbred 
that lacks that trait. This can be accomplished, for example, by first crossing a 
superior inbred (recurrent parent) to a donor inbred (non-recurrent parent), that 
carries the appropriate gene(s) for the trait in question. The progeny of this 
cross is then mated back to the superior recurrent parent followed by selection in 
the resultant progeny for the desired trait to be transferred from the non- 
recurrent parent." The method of backcrossing genes into an inbred maize plant 
is well known and well understood to one of ordinary skill in the art. The method 
has been successfully used since the 1950's (see pages 585-586 of Wych, 1988 
included in the Information Disclosure Statement). In the specification, on page 
21, lines 16-34, there is a description of how to backcross traits into PH5WB, 
which includes the claimed traits. Examples of how one of ordinary skill In the art 
can transfer a gene conferring a qualitative trait into a variety through • 
backcrossing is demonstrated by the fact that the commercial market now 
distributes a multitude of products produced in this manner. Such conversion 
lines are easily developed without undue experimentation. Poehlman et al. 
(1995) on page 334, submitted in the information disclosure statement, states 
that, "A backcross-derived inbred line fits into the same hybrid combination as 
the recurrent parent inbred line and contributes the effect of the additional gene 
added through the backcross." Wych (1988) on page 585-86, also submitted in 
the information disclosure statement, discusses how the male sterility trait is 
routinely backcrossed into an inbred line and how this is used to produce a 
sterile/fertile blend of an F1 hybrid in order to reduce seed production costs. In 
fact, many commercial products are produced in this manner, and those of 
ordinary skill in the art consider the F1 hybrid produced with the male sterile 
(backcross conversion) inbred to be the same variety as the F1 hybrid produced 
with the non-backcross conversion inbred. 
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As a result of the repeated use of the recurrent parent, the backcross 
conversion has many genetic alleles in common with the recurrent parent. Thus, 
genetic analysis may be used as a means of identifying the backcross 
conversion. The declaration attached as A ppendix B explains how genetic 
analysis was used to identify backcross conversion inbreds of PH5WB. The F1 
hybrid made with a transgenic version or a backcross conversion of PH5WB is 
also identifiable by the use of genetic markers, because the hybrid would contain 
one set of alleles from each parent The Examiner also states that the Applicant 
has not characterized parentals. Applicant respectfully disagrees. In a 
backcross conversion the deposited material is repeatedly used as the parental 
line. 

The Examiner rejects claims 9-10, 12-17, 28-32, and 34-44 and states 
that the claims "are also broadly drawn to any plant produced by crossing the 
exemplified inbred line with any of a multitude of non-exemplified plants, or any 
descendents of the exemplified cultivar obtained by using that cultivar as one 
parent in a series of undisclosed crosses for an undisclosed number of 
generations and with undisclosed breeding partners." 

The Examiner rejects claims 9, 10, 28, and 29, that claim the F1 hybrid 
seed and F1 hybrid plant made with PH5WB as a parent Applicant notes that a 
claim to the F1 hybrid made with a deposited inbred was expressly 
acknowledged without reservation by the United States Supreme Court In J.E.M. 
Ag. Supply, Inc. v. Pioneer Hi-Bred Intl, Inc., 60 USPQ 2d 1865,1873 (S.Ct. 
2001), when the Supreme Court wrote, "...a utility patent on an inbred plant line 
protects the line as well as all hybrids produced by crossing that inbred with 
another plant line." Further, one of ordinary skill in the art would know how to 
cross PH5WB with another maize plant. The F1 hybrid seed and plant produced 
using PH5WB, regardless of the other maize plant used, Is Identifiable because it 
will have one set of alleles coming from PH5WB. One of ordinary skill in the art 
would be able to run a molecular profile on PH5WB and the F1 hybrid and be 
able to identify the F1 hybrid as being produced from PH5WB. Seed pericarp 
tissue, which is solely maternal In origin, can be used to discern the maternal or 
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paternal origin of the allele sets if necessary. See page 16 of Poethig, R.S. 
1982. Maize, the plant and its parts. In: W.F. Sheridan (Ed.) Maize for 
Biological Research, University of North Dakota Press, Grand Forks, ND. pp. 9- 
18, submitted as Appendix C. 

As stated in the specification on page 16, lines 8-23, there are many 
laboratory-based techniques available for the analysis comparison and 
characterization of plant genotype such as Restriction Length Polymorphisms 
(RFLPs) and Simple Sequence Repeats (SSRs). Such techniques may be used 
to identify whether or not PH5WB was used to develop a hybrid. The Applicant 
also submits to the Examiner the journal article by Berry et al. (2002). This 
article discusses the probability of identifying the parents of the hybrid by SSR 
data when neither parent is known and without the use of pericarp analysis. A 
copy of the article by Berry et al. is attached to this Amendment and Request for 
Reconsideration as A ppendix D. The results of the experiment showed that 
using 100 SSR toci markers resulted in correct parental ranking of inbreds for 53 
out of 54 hybrids. Applicant also points out that any breeder of ordinary skill in 
the art will know the identity of both parents used to produce a hybrid. 

The Examiner rejects claims 12-17, 30-32, and 34-44. Claims 12-14, 17, 
30-32, 34-36, and 44 have been cancelled. Claims 15, 16, 28, and 29 remain 
pending and are to methods of developing a maize plant through the utilization of 
PH5WB. Applicant points out that anyone of skill in the art would know how to 
utilize the well established breeding methods with PH5WB. Description of such 
occurs throughout the specification and descriptions can also be found in 
introductory plant breeding books. Claims 40-43 remain pending and have been 
amended. Claim 40 is to the method of producing a first generation PH5WB- 
derived hybrid maize plant. Applicant believes the patent office has previously 
indicated this claim scope as allowable and requests that this rejection be 
withdrawn. Claim 41 is to the first generation PH5WB-derived hybrid maize plant 
produced by the method of claim 40. The first generation hybrid, or F1 hybrid, is 
identifiable through both breeding records and molecular marker techniques as 
discussed above. Claim 42 is to the method of selfing the first generation hybrid 
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PH5WB for successive filial generations. This is a basic and well known 
breeding methodology, and the use of this methodology with PH5WB is 
described in the specification on page 21, lines 1 to 15. 

Claim 43 is to plants derived from claim 42 that have at least 50% of their 
genetics derived from PH5WB. These claimed plants are clearly described by 
their method of production, which requires the use of PH5WB. Such plants must 
be produced through the use of PH5WB and the Examiner acknowledges that 
PH5WB is clearly identified. Further, Applicant has added the limitation of at 
least 50% inheritance from the PH5WB side of its pedigree to further emphasize 
the significant influence of PH5WB In the claimed product. Genetic inheritance 
has been accepted by both courts and governmental agencies as an accurate 
and reliable means of identification. In paternity cases courts routinely compel 
genetic testing of putative fathers to establish paternity, and federal law 
mandates that states have laws requiring that genetic test results be admissible 
in such cases without the necessity for foundation testimony or other proof. 42 
U.S.C. 666(a)(5)(F)(iii)(Supp. V 1999). In such cases, a child will, on average, 
inherit 50% genetic contribution from each parent. Similarly, the plants produced 
by the method of claim 42 will also, on average, inherit 50% genetic contribution 
from each parent. 

Applicant requests that the Examiner examine the sufficiency of 
description of claim 43 with all of its claim limitations, including the limitation that 
the progeny be produced by the method of claim 42, with the use of PH5WB and 
retaining at least 50% genetic contribution from PH5WB. One of ordinary skill in 
the art would know how to cross PH5WB to develop an F1 hybrid and also how 
to self plants derived from the cross with PH5WB. In Br parte Parks, 30 USPQ 
2d 1234 (B.P.A.I. 1994), the Board of Appeals stated, "Adequate description 
under the first paragraph of 35 U.S.C. 112 does not require literal support for the 
claimed invention. Rather, it is sufficient if the originally-filed disclosure would 
have conveyed to one having ordinary skill in the art that an appellant had 
possession of the concept of what Is claimed." Emphasis added. In J.E.M. Ag. 
Supply, the Supreme Court also acknowledged the value of a newly developed 
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line in further breeding, when it stated that, "...a breeder can use a plant that is 
protected by PVP certificate to 'develop' a new inbred line while he cannot use a 
plant patented under §101 for such a purpose." Id. at 1873. 

The Examiner cites the Federal Circuit as stating that the written 
description of an invention "requires a precise definition, such as by structure, 
formula, [or] chemical name T of the claimed subject matter sufficient to 
distinguish it from other materials." University of California v. Eli Lilly and Co., 
119 R 3d 1559, 1568; 43USPQ2d 1398, 1406 (Fed. Cir. 1997). The Applicant 
has fulfilled this written description requirement through the seed deposit of 
PH5WB. As described in the specification, lines 8-23 on page 16, the seed 
deposit allows one of ordinary skill to run a molecular profile of PH5WB. 
Applicant submits the molecular profile of inbred line PH5WB in the declaration 
of Dinakar Bhattramakki attached hereto as Appendix E . Further Applicant 
amends the specification to include such SSR profile. Such SSR profile is not 
new matter, as it is an inherent feature of inbred line PH5WB, a representative 
sample of which has been deposited with the ATCC. For example, see Ex parte 
Marsili. Rosetti. and Pasqualucci . 214 USPQ 904 (1972), in which the Patent and 
Trademark Office Board of Appeals held that it was not new matter to amend the 
structure of a compound when a more refined analytic investigation showed a 
corrected formula. The Board > relying on well established cases of In re Nathan 
et al, 51 CCPA 1059, 328 F.2d 1005 t 140 USPQ 601 (1964); In re Sulkowski . 
487 R2d 920 f 180 USPQ 46 (CCPA 1973); Spero v. Rinqold . 54 CCPA 1407, 
377 F.2d. 852, 153 USPQ 726 (1967), and Petisi et al. v. Rennhard et al. . 53 
CCPA 1452, 363 F. 2d 903, 150 USPQ 669 (1966) concluded that the "products 
described, exemplified and claimed by Appellants inherently had and have now 
the structure given in the amendment in question. Consequently, the changes 
made in this amendment do not constitute new matter. Marsili at 906. Similarly, 
in the present case, inbred line PH5WB inherently had and still has the SSR 
marker profile being added. One of ordinary skill in the art can use molecular 
markers to identify PH5WB, a transgenic version of PH5WB, a backcross 
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conversion of PH5WB and the F1 plant of the transgenic version and backcross 
conversion of PH5WB. 

The Applicant would also like to point out that the specification also 
identifies PH5WB with phenotypic descriptions. Various examples of breeding 
methods, transgenes, transformation procedures, and F1 hybrid production are 
given in the specification. The mean values of traits for numerous F1 plants, 
wherein PH5WB is the parent and other numerous maize plants are the second 
parent, are given in Tables 3A-3C, pages 45-47. Tables 4A and 4B, on page 48- 
51, contain mean values of traits to a specific F1 wherein PH5WB is a parent. 

The Examiner also stated, in reference to Lilly, that, "the court also 
concluded that 'naming a type of material generally known to exist, in the 
absence of knowledge as to what that material consists of, is not a description of 
that material. Id," This is not the case here. Applicant has created a novel line 
and seeks a scope of protection that adequately protects the invention- 
Applicant believes that the derivatives, variants and closely related progeny 
easily and routinely created by use of this newly developed line' are 
encompassed within the scope of the invention of the variety itself. These 
derivatives, variants and closely related progeny derive direct and substantial 
benefit from Applicant's work and deserve to be included within the scope of the 
claims. Thus, the issue here is patent scope around what has already been 
created (and deposited) by Applicant. The fact that the progeny have not been 
created does not prevent them from being protected in this manner. As stated in 
MPEP 2163 (3) (a), "An invention may be complete and ready for patenting 
before it has actually been reduced to practice/ 

In Enzo vs. Gen-Probe . U.S. State Court of Appeals for the Federal 
Circuit, 63 USPQ 2d 1609, the court reversed its prior decision regarding the 
insufficiency of the deposited genetic probes to meet the written description 
requirement. In so holding, the court stated, * As the deposited sequences are 
about 850, 8500, and 1300 nucleotides long, there are at least hundreds of 
subsequences of the deposited sequences, an unknown number of which might 
a | S0 mee t the claimed hybridization ratio. Moreover, Enzo's expert, Dr. Wetmur, 
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stated that 'astronomical' numbers of mutated variations of the deposited 
sequence also fall within the scope of those claims, and that such broad claim 
scope is necessary to adequately protect Enzo's invention from copyists who 
could otherwise make minor change to the sequence and thereby avoid 
infringement while still exploiting the benefits of Enzo's invention. The 
defendants assert that such breadth is fatal to the adequacy of the written 
description. On the other hand, because the deposited sequences are described 
by virtue of a reference to their having been deposited, it may well be that 
various subsequences, mutations, and mixtures of those sequences are also 
described to one of skill in the art. We regard that question as an issue of 
fact...." 

The issue of whether the progeny as now claimed satisfies the written 
description requirement is also an issue of fact. PH5WB is a unique inbred, as 
evidenced by the morphological and physiological traits given in Table 1, pages 
18-20, of the application. Routinely used molecular techniques, discussed on 
page 16, lines 8-23 of the application, can be used to verify whether PH5WB is 
within the pedigree of a claimed plant. One of ordinary skill in the art would also 
know from breeding records if PH5WB were utilized in the development of a 
claimed plant 

As stated in the written description guidelines "an applicant shows 
possession of the claimed invention by describing the claimed invention with all 
its limitations using such descriptive means as words, structures, figures, 
diagrams, and formulas that fully set forth the claimed invention. Possession 
may be shown in a variety of ways, including... by describing distinguishing 
identifying characteristics sufficient to show that the applicant was in possession 
of the claimed invention." 1255 Official Gazette 140 (Feb. 5, 2002). Phenotypic 
traits are used in the text of the specification. Genetic and other molecular 
profiles may be obtained from the deposit Once a line is identified as being 
PH5WB, one of ordinary skill in the art would also easily be able to determine 
which progeny they develop from that line fall within the scope of the claims. 
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Within the plant breeding arts breeders use pedigree as a means to 
characterize lines in reference to their progenitors. It is unambiguous and easily 
traceable through breeding records that are maintained by any breeder of 
ordinary skill in the art. It indicates that a line fewer crosses away from a starting 
line will be, as a whole, more highly related to the starting line. Thus, the work of 
the original breeder in developing the starting line will be retained in the closely 
related progeny. More specifically, traits and linkage groups present in PH5WB 
will be retained in progeny that are within one outcross from PH5WB. Applicant 
submits that characterization of the progeny of PH5WB by virtue of their filial 
relationship is a clear and acceptable means of identification. Not only are filial 
descriptions used by breeders to evaluate materials for use in their breeding 
programs, but it is standard practice within the plant breeding industry for 
universities and companies that license inbred maize lines to retain a royalty 
from lines developed through the use of their inbreds. Those royalties are, in 
almost all cases, based on the filial relationship between the licensed inbred 
used in breeding and the progeny line commercialized. This is further evidence 
that those of ordinary skill in the art of plant breeding describe progeny in terms 
of pedigree and find it an acceptable means of characterization. 

As noted in the specification, the development of an inbred line is a time 
consuming and labor intensive activity. On average, between 10,000 to 20,000 
lines are created and screened in order to develop any maize inbred line for 
which the Applicant files a patent application. Once developed, the inbred line is 
useful for two purposes: (1) to make commercial hybrids, and (2) as a source of 
breeding material for the development of new inbreds that retain the original 
inbred's desired characteristics. A breeder desiring to make a line with similar 
traits to PH5WB would be greatly advantaged by being able to use PH5WB as 
starting material. This is because the linked genes arranged through Applicant's 
breeding efforts, and fixed in PH5WB, can be maintained in the progeny of 
PH5WB by a breeder of ordinary skill in the art. The end result Is the 
development of an inbred line with substantial benefit from the Applicant's work. 
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PH5WB-derived progeny are described by the fact that PH5WB is utilized 
in a breeding program to make the PH5WB-derived progeny, PH5WB gives 
genetic contribution to the PH5WB-derived progeny, and the genetics of PH5WB 
are described by ATCC deposit of PH5WB seed. By limiting the progeny to one 
outcross away from PH5WB and by limiting the progeny to those that contain at 
least 50% of their genetics from PH5WB, the Examiner's concern that the 
progeny may be only distantly related to PH5WB is addressed. 

Applicant would also like to emphasize that PH5WB cannot be derived 
through any other means than through PH5WB seed and plant, nor can the 
influence of PH5WB on the progeny be removed from a line within one outcross 
of PH5WB. To view this claim as one of breadth ignores an essentiaUimitation 
of the claim; that only a plant developed through the use of PH5WB is within the 
scope of the claim. Such a plant could not be obtained without the use of 
PH5WB, so the claim would not in any way restrict the work of a breeder that did 
not in fact use PH5WB. Compliance with the written description requirement is 
essentially a fact based inquiry that will "necessarily vary depending on the 
nature of the invention claimed." Vas-Cath v. Mahurkar, 935 F. 2d 1555 (citing In 
re DiLeone, 436 F2d. 1404, 1405). Thus, the compliance with the written 
description requirement must be judged in view of this limited scope of the 
progeny claims. As amended, the claims are drawn to only a limited scope of 
progeny, progeny whose existence is the direct result of the use of PH5WB. 
This is in harmony with the statement In section 2163 of the MPEP that the 
written description requirement promotes the progress of the useful arts by 
ensuring inventions are adequately described in the specification in exchange for 
the right to exclude." That quid pro quo of patent law has been met by the 
Applicant in the present case, and to use written description to deny adequate 
patent protection would be contrary to the stated purpose of the written 
description requirement. 

The Examiner also rejects claims 37-39 under 35 USC § 112, first 
paragraph. Claims 37-39 have been amended for clarification purposes. Claims 
37-39 are directed to growing out an F1 hybrid in which PH5WB is a parent and 
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searching for PH5WB inbred seed. Due to the imperfect process of seed 
production parent seed can sometimes be contained in the hybrid seed bag. 
This claim covers the method of searching for inbred PH5WB seed within a bag 
of hybrid seed. The method is clearly described in the specification on page 5, 
line 21 through line 7 on page 6. One of ordinary skill in the art can practice such 
a method without undue experimentation. The Applicant requests that the 
Examiner withdraw his rejection to claims 37-39. 



The Examiner rejects claims 3, 9-20, 22, 28-32, 34-44, and 47-49 under 
35 U.S.C. 112, first paragraph, as containing subject matter which was not 
described in the specification in such a way as to enable one skilled in the art to 
which it pertains, or with which it is most nearly connected, to make and/or use 
the invention. Applicant traverses the rejection. 

With the exception of the arguments pertaining to Hunsperger et. at., Kraft 
et. al. and Eshed et al., the Examiner provides nearly identical arguments for the 
112, first paragraph, lack of enablement rejection as those provided for the 112, 
first paragraph, lack of written description rejection addressed above. 

To avoid repetition, Applicant respectfully requests that the Examiner 
consider the arguments made in response to the 112, first paragraph, lack of 
written description rejection as also applicable to the 112, first paragraph, lack of 
enablement rejection. In addition, the Applicant directly addresses the 
arguments raised by the Examiner that relate to Hunsperger, Kraft and Eshed. 

The Examiner has cited Hunsperger, Kraft and Eshed and stated that 
they "teach that it is unpredictable whether the gene or genes responsible for 
conferring a phenotype in one plant genotypic background may be introgressed 
into the genetic background of a different plant, to confer a desired phenotype in 
said different plant." The Examiner states that, "Hunsperger et al teach that the 
introgession of a gene in one genetic background in any plant of the same 
species, as performed by sexual hybridization, is unpredictable in producing a 
single gene conversion plant with a desired trait (see, e.g., column 3, lines 26- 
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46)" Applicant's respectfully disagree that this is what is taught by Hunsperger 
et al. Hunsperger et al. teaches that a gene that results in dwarfism of a petunia 
plant can be incorporated into other genetic backgrounds of the petunia species 
(See column 2, line 67 to column 3, lines 1-4). Hunsperger et al. merely 
discusses the level of the expression of that gene differed in petunia plants of 
different genetic backgrounds. Hunsperger et al. succeeded in incorporating the 
gene into petunia plants of different genetic backgrounds, Therefore, Hunsperger 
et al. support the fact that one can introgress a specific trait into a recurrent 
parent through backcross conversion. Applicant's specification provides ample 
disclosure of starting materials such as maize inbred PH5WB, a discussion of 
traditional breeding methods, and examples of transgenes and naturally 
occurring genes that may be used in such methods. Hallauer et al. (1988) on 
page 472, submitted in the information disclosure statement, state that, "For 
single gene traits that are relatively easy to classify, the backcross method is 
effective and relatively easy to manage." The teaching of Hallauer relates 
specifically to com breeding and corn inbred line development, while Hunsperger 
et al relates to petunia. 

The Examiner goes on to state that, "Kraft et al. teach that linkage 
disequilibrium effects and linkage drag prevent the making of plants comprising a 
single gene conversion, and that such effects are unpredictably genotype- 
specific and loci-dependent in nature (see, e.g M page 323)." Applicant disagrees 
that the article states such points. Kraft et al. make no mention of a plant 
comprising a single gene conversion. Further, Kraft et al. relates to linkage 
disequilibrium and fingerprinting in sugar beet, a crop other than maize. Kraft et 
al. state, on p. 326, first column, "The generality of our results for other crop 
species needs to be investigated." 

It is understood by those of skill in the art that backcross conversions are 
routinely produced and do not represent a substantial change to a variety. The 
World Seed Organization, on it's web site, writes, "The concept of an essentially 
derived variety was introduced into the 1991 Act of the UPOV Convention in 
order to avoid plagiarism through mutation, multiple back-crossing and to fill the 
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gap between Plant Breeder's Rights and patents." As determined by the UPOV 
Convention, essentially derived varieties may be obtained for example by the 
selection of a natural or induced mutant, or of a somaclonal variant, the selection 
of a variant individual from plants of the initial variety, backcrossinq, or 
transformation by genetic engineering. The commercialization of an essentially 
derived variety needs the authorization of the owner on the rights vested in the 
initial variety." International Convention for the Protection of New Varieties of 
Plants, as amended on March 19, 1991, Chapter V, Article 14, Section 5(c), 
(emphasis added). A copy of the relevant portion of the UPOV Convention and 
the World Seed Organization web site is attached as Appendix F . 

An example of how one of ordinary skill in the art can transfer a gene 
conferring a qualitative trait into a variety through backcrossing is demonstrated 
by the fact that the commercial market now distributes a multitude of products 
produced in this manner. Such conversion lines are easily developed without 
undue experimentation. Poehlman et al. (1995) on page 334, submitted in the 
information disclosure statement, states that, "A backcross-derived inbred line 
fits into the same hybrid combination as the recurrent parent inbred line and 
contributes the effect of the additional gene added through the backcross." 

The Examiner goes on to state that, "Eshed et al. teach that in plants, 
epistatic genetic interactions from the various genetic components comprising 
contributions from different genomes may effect quantitative traits in a 
genetically complex and less than additive fashion (see, e.g., page 1815). The 
Applicant would like to point out on page 1816, column 1, lines 1-5 of the Eshed 
et al. article it states, "Recent studies that detected epistasis of selected QTL in 
Drosophila (Long et al. 1995), soybean (Lark et al.1995) and maize (Doebley et 
al.1995; Cockerham and Zeng 1996) did not show a less-than-addit'rve trend." 
Emphasis added. Applicant also adds that transferring a qualitative trait does 
not require undue experimentation. Please note Hallauer et al. (1988) on page 
472, submitted in the information disclosure statement, which states, "For single 
gene traits that are relatively easy to classify, the backcross method is effective 
and relatively easy to manage." Claim 51 has been amended to expedite 
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prosecution. In claim 51, the genes transferred into PH5WB are now limited to 
the traits of herbicide resistance, insect resistance, disease resistance, male 

sterility, and waxy starch. 

■me Examiner alleges that Applicant's traversal of the art rejection on 
pages 7-6 of the amendment of 12 August 2002 "admits that outcrossing the 
exemplified inbred to another undisclosed plant is unpredictable." Applicant's 
statements on pages 7-8 of the amendment of 12 August 2002 make the point 
that it is not possible to use the cited prior art line F361 to produce a line that is 
the same as PH5WB or any of the claimed progeny of PH5WB. Applicant did 
not in any way state that it would be unpredictable to introgress a gene into 
PH5WB through backcross breeding techniques or transformation. 

In light of the amendments to the claims and the foregoing arguments the 
Applicant requests reconsideration of the rejection under the first paragraph of 
35 U.S.C 112. 



Claim Rejections under 35 U.S.C. § 102 and 103 

The Examiner states that, "Claims 14 and 43 remain rejected under 35 
U.S.C. 102(b) as anticipated by or in the alternative, under 35 U.S.C. 103(a) as 
obvious over Dowden (U.S. Patent No. 5,880,339)." Applicant traverses the 
rejection. 

Applicant has cancelled claim 14. Applicant has amended claim 43. 
Claim 43 is limited to progeny produced by the method of claim 42, which 
requires the use of PH5WB, and is further limited to progeny deriving at least 
50% genetic contribution from PH5WB. 

The Examiner has made several objections which relate to written 
description and these were addressed in the response to the written description 
section above. The Examiner then makes the statement that "It is unclear and 
unlikely that the genetic material that remains in the first generation progeny 
retain the unique genetic and morphological complement of the exemplified 
inbred... Furthermore, since the individual traits exhibited by the exemplified 
inbred are not unique to the exemplified inbred, either by degree of expression 
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nor genetic means of inheritance, a descended of the exemplified inbred 
containing no genetic material from it could still exhibit these individual traits, and 
could still possess the same genetic means for conferring these traits as did the 
exemplified inbred." 

Applicant responds that such a described inbred is not within the scope of 
the claim as amended, because it would not have at least 50% genetic 
contribution from PH5WB. Further, the benefit of the invention relates not to 
plants comprising these traits in general, but plants that comprise these traits 
through the use of PH5WB and the work of Applicant in fixing these traits and 
linkage groups in PH5WB. Nothing in claim 43 would restrict a breeder from 
making, using or selling a line with all of the same morphological and 
physiological characteristics of PH5WB or the plant of claim 43, so long as such 
plant was created without the use of PH5WB. 

Further, as evidenced by the declaration of Stephen Smith submitted as 
Appendix G. both PH5WB and its progeny within the scope of claim 43 are 
distinct from F361 taught in U.S. Patent No. 5,880,339. 

In light of the above, Applicant respectfully requests that the Examiner 
reconsider and withdraw the rejection to claims 1-49 under 35 U.S.C. 102 (b) 
and 103(a). 

Claims 1, 2, 4-10, 15-16, 21, 23-29, 37-43, and 50-57 are now pending in 
the application. The amendments made herein do not in any way change the 
claim scope which the Applicant believes is allowable but is meant to hasten the 
issuance of the patent. 

CONCLUSION 

Applicant submits that in light of the foregoing amendments and the 
remarks, the claims 1, 2, 4-10. 15-16, 21, 23-29, 37-43, and 50-57 are in 
condition for allowance. Reconsideration and early notice of allowability is 
respectfully requested. If it is felt that it would aid in prosecution, the Examiner is 
invited to contact the undersigned at the number indicated to discuss any 
outstanding issues. 



Respectfully submitted, 
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Abstract. The backer^ breeding procedure ha* been used wider/ to transfer simply Inherited train Into elite genotype;. 
Genetic markers ctn increase the effectiveness of backc ratling fay 1) increasing the probability of gbtainfag a suitable 
conversion, and 2) decreasing the time required to adtfcTe en Acceptable recover;, Simulation and field results indicated 
that, for a genome consisting often 100-cM chromosomes, basing selection on 40 or 80 markers In 50 BC Individuals thai 
carry the allele being transferred can reduce the number of backcross generations needed from abpat scrcn to three 



The baekcross breeding procedure has been used widely 
to trtnsfer simply inheriied traits into elite genotypes. 
Usually, the ualc being irinsfcrred is controlled by a 
single gene, buthighly heritable uaitt mat arc more complexly 
inherited have also been transferred successfully by baeke reus- 
ing; Cor example, maturity in maize (RJnte and Seniz, 1961; 
Sbever, 1976), Today, baefcerosjing is being nred to transfer 
genes introduced by such techniques *s b^nsfgcmatloa or 
mutation into appropriate gennplam 

Several plant breeding textbooks give good descriptions of 
the bacfccroH procedure GUIard, I960; Fehr, 1987). A donor 
parent (DP) w^ngatrilEOfintefestiscroi^dtothAitcuTrCTt 
parent (llP), an clita line that Is lacking Che trait The F, Is 
crossed back w the RP to produce the BC t generation. In the 
BC, and subsequent backcross generations, selected individu- 
als carrying the gene being transferred are backerowed to die 
RP, The expected proportion of DP genome is reduced by half 
with each generation of bickcrosslng. Ignoring effects or link- 
age to tba selected DP allele being transferred, the percentage 
recurrent parent (%RP) genome expected in each hackexois 
genoradon ii e&leutatedl ae: 

%R*»lOO[l-(0.S)' l ♦ , ] 

where n is the number of backcrassea. 

Backcrossing Of ^elected plants to the RP can be repeated 
each cycle until a line is obtained tfaatls essentially a version of 
the RP that includes the irixogreased allele. After six back- 
crosses, the expected recovery is >99% (Table I). 

Until recently* discussions of thoiecbvery ofiheRP genome 
during backcrossinghsve emphasised the axpecced values for 



Analysis ofMoUcularMarkir D<ua 



%KP shown in Table 1, and have largely ignored ihe genetic 
variation for 9bRP that exists around the expected mean. With 
the development of genetic markers capable, of providing good 
genome coverage, there has been interest In taking advantage of 
dial vhriatian to jncrca&B the efficiency of backcrossing. 

Selection for RP marker alleles can Increase greatly Che 
efl^venesstfbactaoss programs by aJlowmg the breeder to • 
1 ) select backcross plants that have a higher proportion of RP 
genome, end 2) select bacVeross Individuals that are better 
conversions near a mapped donor allele being transferred (Le,, 
seleet for less linkage drag), Expressed in practical terms, using 
generic marker* to assist backcrossing can 1) Increase the 
probability of obtaining a sukahia conversion, and 1) decrease 
the time required to aehinve Pui acceptable recovery. 

Issues to consider when planning a marker-assisted back- 
cross program Include t ) the time advantage of using markers 
to assist backcrossing. 2) the number of markers needed, and 3) 
the number of genotypes CO evaluate, In this report, we use 
results from previous literature, computer simulation, and em- 
pirical studies to provide some guidelines. 

Table 1. toptc f«rf Wflwry o/Mwrittr jwwir f *4om* aurin^ 
badtawiiif, mhwiMj m Untogt to ihi gvu MAg o^mjarrgd. 
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lUtr 



BC, 
BC* 
BC, 

"ft 



50.0000 
75.0000 
87J00O 
93.7S0O 
96.&750 
98.4375 

99,21 aa 

99,6094 
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j4aterbh and methods 

The maize genome was [he model for the simulation, the 
• Kimulatedgenom^conulnedtca^OO^Mchronio&omes. Simvl* 
lotion of crossing aver was based on a Pulsion dlstrlbu tion with 
i mean of X0 (\ * 2) (Hanson. I939J. which, on average, 
generated One Croaa over for every 1 00-cM length, The simula- 
tions teported hot assume no interference Codominant ge« 
' actio markers were evenly distributed in the genome and site* 
0f the donor gene were randomly aligned to genome locations. 
1 Simulations were conducted with [he following parameters: 

Number of progeny: 100 or 500. 

Bacfcetw generation* BC,,BCj, and BC,. 

Number Of marker: 20, 40, 80. or 100. 

Number selected co form the next BC generation: I or 5. 

Selection wis based on l)prwenc* of the donor allele and 2) 
high %RF), %t& was calculated « the average ef the (one or 
five) selected individuals. Values presented are the mean of 50 
1 rimulatioas. 

In the computer aimnlation study, all methods modeled 
>, greatly increased the speed of recovering (he RP genome 
, compared to ute expected recovery with no marker- agisted 
y selection (compare Tables 1 md 2). At least SO markers ware 
. required to recover 99% of the BP genome in jvat three BC 
generations (Table 2), Uw* of at least 80 marker* and 500 
■ progeny allowed recovery of RP la Just wo BC genera- 
tions. Response to selection was diminished only slightly by 
spreading the effort over five selections. Using markets, the 
• number of backer oss generations needed co convert an inbred 1 5 



Tibti & Perwni recurrent parent genome dtirinz mnrhrairistcd ba dam ul*i 



reduced from about scvan to ihrefc. 

By the BC, generation, there appear? w be no practical 
advantage to using 500 v S . 100 individuals. If me presence of 
the donor trait In the backcross individuals can be ascertained 
befo» > ifvktfs i are genoryped, (hen only half the number of 
individuals indfeaftd in the tables win need to be analyzed. 

When a small number of markers arc used, they quickly 
became non-informative; I.e., selection causea the marker loci 
to became fined for the RP iype before the lest of the EeoOrtl* 
Is fully cbnveTted(Tablc3;Hwplta| eta!., l992).Thissicuidon 
was most promineat in the larger populations, whew i higher 
selection intensity placed more selection pressure upon the 
marker loci. Accordingly, it is of interne k> Consider how 
closely the estimation of %RP based on matter* rcflcctj ihe 
actual geaome composition. The combination of estimation of 
ftRPbBSedon fewer markets and subsequentselectioa tends to 
bias thejestimates upward (compare Tables 2 and 3). 

The f'csulis from the simulation compare well with real field 
data. IftirypicalexampIe.SOBC^IantScarrylngthegrtie being 
transferred were geaofypod at 83 polymorphic RFLP loci (note 
that thid corresponds to a population size of 100 unsetected 
plants in Tabic* 2 and 3). The five beat BC recoveries bad 
estimated %RP valuei of 8£9* r 42.7*, S2.Q%, 81.4ft, and 
SL2*. After evaluating 10 BC^plaaXS from each lelectedBC,, 
(he bcst|fiC z recovery had an estimated %RP of 94.6%. 

Diseiission 

The simulations (Tabic 2; Hospital ec aL, 1992) and our 
axparleeea indicate mat fourmarken per200-cM chromOiOme 
is adequate to greatly increase the effeciivefless of selection in 
the BC lf However, using only four markers per 200 cM will 
likely make it very difficult to map the location of ma gene of 
Interest. Adequate tfummariaation of the data is an importam 
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; p3fT of a markcr»assisled backeroas program. Ideally , the mark- 
ers usedcau supply data that can be represented as iflelesofloa 

' «ltb known map position. Estimation of %RP, mappin g the 
position of the locus of interest, and graphite) display of (he 
results (Young and Tankslcy, 1969) ere all useful in imder- 
nanding end controlling the specific baekenm experiment 

■being Conducted, 

It appear* that, with the use of generic markers, the portion 

« of the KP genome thai Is not linked to (he allele being trans- 

• .jfcned can be recovered quickly and with confidence. The 
recovery of RP will bo slower on the chromosome carrying the 

'.gene of interest A conudcnhle amount of linkage dag is 
' expected to accompany selection for the DP allele in a back- 
» iross program. For a locus located in the middle of a 200-cM 
cfarofflosome, the length Of the DP chromosome segment ao- 
\ cornfanying selection \s expected to bo 126, 63, and 28 cM in 

* <ho BC, BC J( and BC T generations, respectively (Hanson, 
' j959;tiAVciraandBarbadilla, 1992). Our observations support 

the recommendation of Hospital ec id, (2992) that prof a fence be 
,* given (o the selection for recombinants proximal to the allele of 
1 ^interest, bat thai selection for recovery of the RP elsewhere In 
(be genome also be considered. Hri* two-ctaga selection can 
probably be done quite effectively ad hoc by the breeder once 
; the data is adequately summarized; however, Hospital ct al. 



suggest ways to Incorporate the two criteria into a selection 
index 5uch that each component of selection Is assured appro- 
priate weighting. 

Use oogenetic markcrscan greatly increase theeffectiveneSfi 
Of baefcerussmg, and they should be used in any serious back- 
crossing program if resources are available to the breeder 
t 
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Attorney Docket No. 1364 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
Applicant: Mark David Hoffbeck Date: March 1 1 , 2003 

Serial No.: 09/758,790 Group Art Unit: 1 638 

Filed: January 1 1 , 2001 Examiner: David T. Fox 

For "INBRED MAIZE LINE PH5WB" 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

RULE 132 DECLARATION 
OF 

.DR. STEPHEN SMITH 

Sir: 

l f Stephen Smith, PhD., do hereby declare and say as follows: 

1 . I am skilled in the art of the field of the invention. I have a Ph.D. in Biochemical 
Systematics and Taxonomy of Maize and its Wild Relatives from Birmingham University. 
I have a M.Sc. in the Conservation and Utilization of Plant Genetic Resources from 
Birmingham University. I have a Bachelor of Science degree in Plant Sciences from 
London University. Since 1977 1 have been engaged in the development, study and 
application of molecular markers to genetics, measuring genetic diversity and tracking 
pedigrees. I commenced this work at North Carolina State University as a post-doctoral 
research fellow. I have continued my engagement in these studies during my 
employment by Pioneer Hi-Bred from 1980 until the present. These studies have 
resulted in numerous scientific articles that have appeared in peer reviewed scientific 
literature. 

2. I have read and understood the Office Action in the above case dated October 
11, 2002. This declaration is in response to the Examiner's rejection under, 35 U.S.C, 
§112, first paragraph, as containing subject matter which was not described in the 
specification in such a way as to reasonably convey to one skilled in the relevant art that 
the inventor(s), at the time the application was filed, had possession of the claimed 
invention. 
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3. I have conducted an analysis of Simple Sequence Repeat, SSR, marker data for 
base inbred PH5WB and two backcross conversions of PH5WB. One backcross 
conversion of PH5WB has the trait of mate sterility and the other has the trait of waxy 
starch. 

4. The SSR data for 457 base inbreds and 103 backcross conversion inbreds, 
including PH5WB and the two backcross conversions were used in the analysis. The 
number of SSR markers for each inbred used in the analysis was between 15 and 87 
(mean of 82 ). The analysis was done as specified in the publication by Berry et al. 
("Assessing Probability of Ancestry Using Simple Sequence Repeat Profiles: 
Applications to Maize Hybrids and Inbreds" Genetics 161:813-824, 2002), with 
modification as described in Berry et al., ( 2003 ); Assessing Probability of Ancestry 
Using SSR Profiles: Application to maize inbred lines and soybean varieties. Genetics 
(in review), a copy of which is attached hereto. 

5. The results of the analysis indicated that through the use of SSR markers 
PH5WB was identified to be the recurrent parent of each of the two backcross 
conversions of PH5WB over all the other inbreds in the data set. The probability 
associated with the identification of PH5WB as the recurrent parent of each backcross 
conversion was calculated as 0.99. 

6. ! hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be true; and 
further that these statements were made with the knowledge that willful false statements 
and the like are punishable by fine or imprisonment or both, under Section 1001 of Title 
18 of the United States Code and that such willful false statements may jeopardize the 
validity of the application or any patent issued thereon. 

Date: 

Stephen Smith 
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ABSTRACT 

Determining parentage is a fundamental problem in biology and in applications such as 
identifying pedigrees. Difficulties inferring parentage derive from extensive inbreeding within 
the population, whether natural or planned; using an insufricient number of hypervariable loci; 
and from allele mis-matches caused by mutation or by laboratory errors that generate false 
exclusions. Many studies of parentage have been limited to comparisons of small numbers of 
specific parent-progeny triplets. There have been few large-scale surveys of candidates in which 
there is no prior knowledge of parentage. We present an algorithm that determines the 
probability of parentage in circumstances where there is no prior knowledge of pedigree and 
which is robust in the face of missing data and mis-typed data. The focus is parentage of an 
inbred line having uncertain ancestry. The algorithm is a variation of a previously published 
hybrid-focused algorithm. We describe the algorithm and demonstrate its performance in 
detenruning parentage of 43 inbred varieties of soybean that have been profiled using 236 SSR 
loci and from seven inbred varieties of maize that were profiled using 70 SSR loci. We include 
simulations of additional levels of missing and mis-typed data to show the algorithm's utility and 
flexibility. 
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The determination of parentage using molecular marker data has been little addressed for 
situations where there is little or no prior knowledge of parentage, or when large-scale surveys 
involving numerous candidate parents are required. Consequently, we have recently developed 
an algorithm and demonstrated its use in determining probability of parentage for hybrids in 
circumstances where there is no prior knowledge of pedigree and which is robust in the face of 
missing or mis-typed data (Berry et al 2002). We now present a variation of this algorithm that 
allows determination of parentage for inbred lines or homozygous varieties. 

We describe and evaluate a methodology that quantifies the probability of parentage of 
homozygous genotypes. Our algorithm takes into account that generations of self-pollination 
occur after the initial parental cross. The number of generations and the initial parental genotypes 
are unknown. Each generation of inbreeding reduces the number of heterozygous loci in the 
progeny by an average of 50%. Thus, each of the inbred progeny individuals resulting from the 
initial parental cross will have lost approximately half of the parental alleles for loci where the 
inbred parents were fixed for alternate alleles and which were heterozygous in the Fl generation. 

The loss of parental alleles during the inbreeding phase is in contrast to the case of a hybrid 
progeny. An inbred progeny individual will exhibit a lower level of allelic similarity to either of 
its inbred parents than a hybrid progeny will to its inbred parents. This loss of some parental 
alleles during inbreeding might be expected to make an inbred algorithm less robust in the face 
of missing or mis-typed data compared with the hybrid algorithm that has been previously 
described (Berry et al 2002). We therefore demonstrate the effectiveness and robustness of the 
inbred algorithm using examples from two species of cultivated plants. We first tested the 
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algorithm using varieties of the naturally self-pollinating, inbred crop, soybean [Glycine max. 
(L.) Merr.]. This crop was selected because numerous varieties of soybean with known pedigrees 
were available to us, many of which are closely related* We also used publicly bred inbreds of 
maize (Zea mays L) that are of known pedigree. Maize is naturally an outcrossing species but 
inbred lines are most usually generated for use as parents of commercial hybrids. Inbred lines are 
generated by making successive generations of self-pollination following the initial bi-paxental 
cross. 

MATERIALS AND METHODS 

Algorithm: The algorithm is a variation of the hybrid version of Berry et al (2002). Consider an 
index inbred whose parentage is unknown or in dispute. A database containing possible inbred 
ancestors is available. The objective is to find the probabilities of closest ancestry for each inbred 
in the database using genotypic information from a large number of SSRs. 

Consider a pair of possible ancestors, inbred / and inbred j. We calculate the probability that 
inbreds i and; are in the index's ancestry, repeating this for all pairs of inbreds in the database. 
Let P(iJ\SSRs) stand for the posterior probability that i and j are ancestors of the index given the 
information from the various SSRs. Let P(ij) stand for the unconditional (or prior) probability of 
the same event and let P(SSRs\i,f) be the probability of observing the various SSR results if in 
fact / and j are ancestors of the index. Just as in Berry et al (2002), Bayes' rule relates these 
various probabilities: 
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P{ij\SSRs) = P(SSRs\iJ)*P(hJ) ' IiP(SSRs\u,v)*Piu.v)], 

where the sum in the denominator is over all pairs of inbreds in the database, indexed by u and v. 
We need to calculate P{SSRs\i,j) for each i and;. We will make the "no-prior-informatioii" 
assumption that P{iJ) is the same for all pairs (ij). Then P{u, v) is a common multiple in the 
denominator that cancels with P(ij) in the numerator 



PXtJ \SSRs) = P{SSRs\iJ) I If{SSRs\u,v). 

The problem is to calculate a typical P(SSRs\ij), the probability of observing the index's SSRs 
assuming inbreds / and; are both ancestors. The nature of breeding before the self-pollination 
process is unknown. Since the creation of an inbred proceeds by multiple generations of self- 
pollination on a hybrid, we label the (unknown) hybrid used to create the (known) index inbred 
as the intermediate hybrid. When the intermediate hybrid is an immediate descendent of i and;', 
it receives one of inbred i's alleles and one of inbred;'s alleles. When the intermediate hybrid is 
a second generation descendent of i and;', it receives one allele from each with probability 0.5. 
And so on. Since degree of ancestry (if any) is unknown, we label the actual probability of 
passing on one of these alleles to the intermediate hybrid to be p. As in Berry et al. (2002) we 
consider /> = 0.50 andp = 0-99 and here we also consider the intermediate value p = 0.75. 

When inbreds / and ;' are ancestors then there are four possibilities: (1) the alleles of both / and; 

were passed to the intermediate hybrid, (2) i came through but not;, (3); came through but not i, 

z 

and (4) neither came through. Assuming independence, these have respective probabilities/? , 
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p{l-p) t p(\-p), (\-pf. An allele in the inteimediate hybrid's genotype that did not arise ftom 
either inbred i or inbred J is assumed to be selected with probability where n is the total 
number of alleles at the SSR in question. So far the steps we have described are identical to those 
for identifying the ancestors of a hybrid described by Berry et al (2002) and, in fact, if the index 
is heterozygous at an SSR then calculations proceed just as for hybrids. Calculations are 
substantially different when the index inbred is homozygous, say genotype aa. Cases that must 
be considered are shown in Table I, where* is any allele different from a (but not missing). All 
alleles other than a can be grouped because only a appears in the index's genotype. For example, 
xx might be be or bd or bb. 

P(SSR\iJ) is the probability of observing the index assuining inbreds i and; are ancestors. The 
calculations for SSRs 1 to 6 are shown in Table 2, where the four terms in each case are in order 
of (2), (3), (4) defined in the previous paragraph. Missing alleles are not considered in the 
examples above. The number of possibilities is large. Here we consider only the case in which 
inbred i is aa and both alleles of inbred j are missing. Then 



P(SSR\ij)=p 2 (l/2H/2*i/n)+p(l-p)(l/2+lM*l/2) + p(l-p)(l/n) + (l-pf(lfn) 



Another possibility not considered above is that more than two alleles can be observed for an SSR 
marker run on individual DNA sample. This can be due to SSR locus duplication, homology due to 
alloploidy, more than one individual plant being sampled for DNA extraction or cross-contamination. In 
this case we consider all possible pairings of the observed alleles and calculate using a multiple 
imputation procedure (Little and Rubin, 1987). 
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To find the overall P(SSRs\iJ), multiply the individual P(SSR\ij) over the various SSRs. To 
determine the probability that any particular inbred, say inbred /, is the closest ancestor of the 
index, sum P(SSR\i,v) over all inbreds v with v Call this P(i\SSR$). The maximum of 
P(i\SSRs) for any inbred / is 1 . But since there is one closest ancestor on each side of the family, 
the sum of P(i\SSRs) over all inbreds / is 2. 

SSR data: Soybean DNA was extracted from 490 varieties, all of which were bred in, and are 
adapted to, the United States. Plant material for DNA extraction was sampled from six plants of 
each variety. Most of the varieties are proprietary products of Pioneer Hi-Bred International. 
Several (non-patented) commercial varieties from other breeding companies and some important 
publicly bred varieties were also included. Procedures for obtaining SSR data from soybean were 
identical to those described for maite by Berry et al. (2002) apart from the following 
modifications: PCR products with different size ranges and labeled with different fluorochromes 
were pooled and diluted 1 :9 with capillary electrophoresis buffer (Applied Biosystems) then 1 :4 
with dH20. 1.5ul of pooled DNA were added to lOul formamide containing the molecular weight 
size standard 400HD ROX (Applied Biosystems, ROX - 6-carboxy-X-rhodamine). Fragment 
separation was performed using capillary electrophoresis on an ABI3700 platform (Applied 
Biosystems), with an injection time of 10 sec at 10,000 V and a run time of 4,000 sec at 7,500 V. 
Forty-three soybean varieties that had both of their parent varieties also included in the dataset 
were assigned as index varieties, One to two and occasionally three grandparent varieties of 
several of the index varieties were also included in the dataset. These varieties collectively 
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represent a broad array of diversity of soybean germplasm that is currently grown in the United 
States. 

Two hundred and thirty-six publicly available soybean SSR markers 
(http://soybase.agron.iastate.edu/) were used to demonstrate and evaluate the algorithm. These 
SSR markers were selected following initial screens on a subset of 24 soybean varieties in which 
they were tested for amplification and the ability to detect polymorphism. The 236 markers gave 
good genome coverage and collectively mapped across each of the chromosomal linkage groups 
of soybean. 

All allele scores were made without knowing the identities of the soybean genotypes. 

Maize SSR data using 70 loci were previously reported by Senior et al (1998) and were obtained 
directly from the first author. This publication (Senior et al 1998) cites an array of 94 
historically important publicly bred lines that have well known and well established pedigrees. 
This array of public inbreds includes seven inbreds (A632, A634, Mol7 s Pa91 , Va35, Va99 and 
W64A) that each have SSR profiles for their parental lines included in the same dataset. Three of 
these inbreds were developed from a breeding cross of two unrelated parents. These are: Mo 1 7 
which was bred from the cross of C.L 187-2 x C 103; Va99, which was bred from the cross 
Oh07B x Pa91; and W64A which was bred from the cross of WF9 x C.I. 187-2. Other inbred 
progeny had more complex pedigrees. One inbied (Va35) was bred from the cross C103 x T8 
following an additional cross of T8 as the recurrent parent. Two inbreds (A632 and A634) were 
bred from the cross Mt42 x B14 following additional crosses of B14 as the recurrent parent. 

9 
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Pa91 was bred from a complex cross involving four inbreds (WF9 x Oh40B) and (38-1 1 x 
L317). These seven progeny inbreds therefore provided an index set of maize inbreds for 
evaluation of the inbred algorithm. 



Data quality: The soybean SSR data that were used to evaluate the algorithm had a mean of 
5,5% (range 0-19% loci) missing data per variety. For parent-progeny triplets, there was a mean 
of 1 . 1 % loci (range 0-5%) where a progeny profile was scored for an allele that was not 
represented by either of the seed sources that represented the parents. The maize SSR data had a 
mean of 0 .7% missing data (only three genotypes had missing data; these were at elevated levels 
of 5%, 9%, and 36%). A mean of 6.4% parent/progeny triplets (range 4-7%) had SSR progeny 
profiles that did not share an allele with either of the seed sources that were available to represent 
the original parental genotypes. 

Probability of ancestry applied to soybean data: Figures 1 and 2 present the probabilities of 
closest ancestry of the top ranking varieties for each of 43 soybean varieties using data from 236 
marker loci atp = 0.50 (Fig 1) and at p » 0.99 (Fig 2). 

When the algorithm was used at p = 0.5 with data from all 236 loci (Fig 1), then 24/43 (56%) of 
index varieties had both parents correctly identified in the top two ranked positions, 12/43 (28%) 
had one parent correctly placed in one of the top two positions, and 7/43 (16%) had none of the 
actual parents assigned into the top two ranked positions. Thus, when p = 0.5 was used, 60/86 



RESULTS 
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(70%) of actual parental varieties were correctly ranked in the top two positions and 26/86 (30%) 
were incorrectly placed in lower positions. 

When the algorithm was used at p = 0.75 with data from all 236 loci (data not shown), 28/43 
(65%) of index varieties had both parents correctly identified in the top two ranked positions, 
1 1/43 (26%) had one parent correctly placed in one of the top two positions, and 4/43 (9%) had 
none of the actual parents assigned into the top two ranked positions. Therefore, whenp = 0.75 
was used, 67/86 (78%) of the actual parental varieties were correctly ranked in the top two 
positions and 19/86 (22%) were incorrectly placed in lower positions. 

When the algorithm was used atp = 0.99 with data from all 236 loci (Fig 2), then 33/43 (77%) of 
actual parental varieties were correctly ranked in the top two positions and 10/86 (23%) had one 
parent correctly placed; all index varieties had at least one parent ranked in the top two positions 
when the algorithm was used atp = 0.99. Withp used at 0.99 then 76/86 (88%) of actual parental 
varieties were correctly assigned; 10/86 (12%) were incorrectly assigned. 

Table 3 presents the rankings, probabilities, and pedigrees of varieties that were incorrectly 
assigned above a true parent. The largest pedigree class (41% of cases where a non-parent ranked 
above a true parent) of non-parents ranking higher than parents was for varieties that are 
derivatives of the parent that was misplaced at a lower ranking. The equal second largest classes 
(each representing 14% of the cases) were for varieties that were (a) full sibs of the true but 
misplaced parent and (b) full sibs of a grandparent of the variety for which the pedigree was 
being tested. Other categories (percent of cases in parentheses) were: multiple backcross versions 
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of the misplaced parent (7%), a derivative of the variety or which the pedigree was being tested 
(7%), a half-sib of the true but lower ranked parent (7%), a full sib of the variety for which the 
pedigree was being tested (3%), and a half-sib of the variety for which the pedigree was being 
tested (3%). Insufficiently detailed pedigree information is available to categorize one variety 
(3% of cases) that ranked above the true parent 

Robustness: The quality of soybean SSRdata as received from the laboratory, in terms of 
missing data and apparently non-Mendelian parent-progeny triplets, have already been F «ented. 
Taking these data as an initial starting point, additional levels of missing and mis-typed data 
were created by simulations and used to explore robustness of the algorithm. 

SSR data for five index soybean varieties were used to determine the robustness of the algorithm. 
Subsets of data were created that included parameters of reduced numbers of loci, additional 
levels of missing data, additional levels of mis-typed data, and various combinations of these 
parameters. Simulated levels of missing and mis-typed data were created with a first pass 
creating missing data, followed by a second pass creating mis-typed data. Therefore, for 
example, the maximum level of cumulative error from simulated missing and mis-typed data was 
from 36 to 40%. Five varieties were chosen to represent a range of diversity in respect of both 
pedigree and SSR profiles. Four varieties had no parents or grandparents in common and one 
pair of varieties was related by a common parent. All varieties had parents ranked in the top two 
positions when the algorithm was run at/> = 0.75 and/> - 0.99. This selection of varieties 
therefore provides a means to establish lower boundaries for both the quantity and quality of 
SSR data that are required to avoid aberrant results. 
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Table 4 presents the probability of ancestry of the top five ranked varieties for each of five 
selected soybean index varieties (93B11, A7986, P9443, S38T8 and Young) when the algorithm 
is run using different numbers of SSR marker loci (50, 100, 150 and 236) at each of two levels of 
p (0.5 and 0.99). Usingj? = 0.5, the lowest percentage of patents (60%) that were correctly 
ranked into the top two positions corresponded to using only 50 SSR. Increasing the number of 
loci to 100 or 1 50 or 236 increased the ability to identify the actual parents to about 90%. When 
p was used at a level of 0.99 all parents were correctly ranked into the top two positions for each 
of the five varieties when data from as few as 50 SSR loci were used. 

Table 5 summarizes other aspects of robustness. Namely, we simulated additional levels of 
missing, mis-typed and missing plus mis-typed data, beyond those that were inherent in the data 
as provided by the laboratory. When p was used at a level of 0.5, robustness was generally 
maintained up to an additional level of 20% simulated missing data, so long as data from 100 or 
more loci were used. Similarly, robustness was maintained for up to 20% additional mis-typed 
data so long as data from 100 or more loci were used. Likewise, robustness was maintained with 
up to 1 8 to 20% additional levels of data error including both missing and mis-typed data, so 
long as data from 150 or more loci were used. Using data for all 236 loci provided a higher level 
of robustness, but even then robustness collapsed when 36 to 40% cumulative additional error 
from missing and mistyped data were simulated into the analysis. The overall level of correct 
assignation of parent varieties was higher when p was used at a level of 0.99. All parents then 
were correctly identified, even when data from only 50 loci were used up to an additional level 
of 10% missing data. When data from 100 or more loci were used then all parents were correctly 
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identified with up to 20% additional missing data. Robustness started to decline when the 
algorithm was applied with 10% additional mis-typed data when data from 150 or fewer SSR 
loci were used. However, robustness was maintained for up to 20% additional mis-typed data 
when data from 236 SSR loci were used. When additional levels of both incorrect data were 
applied then robustness was maintained at levels of up to 10% missing plus 10% mis-typed data 
so long as data from at least 150 SSR loci were used. Robustness was compromised when 
additional simulations of 20% missing plus 20% mis-typed data were applied even when data 
from all 236 SSR loci were used. 

We then investigated the relationships of varieties to the index genotype whose pedigree was 
under examination by rerunning the analysis after both parents of the index genotype had been 
removed from the analysis. Fifteen varieties that had two or more of their grandparents profiled 
in the dataset were used for this examination. After removing parents, direct pedigreed 
derivatives of the index genotype ranked first for P9583, in the first three places for A2943 and 
in the first six places for P9561. Once all parents and derivatives of the index genotype had been 
removed from the analysis then the following results were obtained. Predominant classes of 
varieties ranking in the top five positions were (percent of cases in parentheses): derivatives of 
the grandparent of the index variety (32%), grandparents of the index variety (16%), derivatives 
of the parents of the index variety (16%), and half-sibs of the index variety (13%). Grandparents 
ranked among the first four positions for 10 varieties and were in the first place for five varieties. 
Great-grandparents ranked within the first seven places for three varieties, and a great-great- 
grandparent ranked in eighth place for one variety. Other varieties that ranked in the first place 
were usually closely related to the variety whose pedigree was under examination; full-sibs and 
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half-sibs were the predominant classes of relatives other than grandparents in the first ranking 
position after parents and direct derivatives of the variety under examination had been removed. 

Probability of ancestry applied to corn data: The seven index inbreds of maize were selected 
because they represented all of the inbred l ines published upon by Senior et al. (1998) that had 
all of their inbred parents also included in the SSR dataset. All of the inbred lines published by 
Senior et al. (1998) have well known and well established pedigrees that are fully provided by 
those authors. 

Table 6 presents probabilities of ancestry for the top five ranked inbreds for each of the seven 
index inbred lines at two levels of p (0.5 and 0.99). For the three progeny that were bred from 
single crosses without any subsequent use of one of the parents to make a recurrent cross prior to 
inbreeding (Mol7, Va99, and W64A) then use of the algorithm at either/? = 0.5 or at£ = 0.99 
resulted in the parental inbreds being ranked in first and second positions. Use of the algorithm at 
p = 0.99 provided greater discrimination for probabilities of ancestry that were assigned to actual 
parents compared to highest ranking non-parents. This was most noticeable for the case of inbred 
Va99 which had a relatively low value when used at p = 0.5 for parent 2 (0.5221) compared to 
parent 1 (0.9999) or to the third ranked inbred (and non-parent), Va22 (0.4252). In contrast, 
when the program was run at p = 0.99 then parent 1 and parent2 for Va99 had probabilities of I 
and 0.9855, respectively, with the probability of the third ranked inbred being 0.0131. 

For each of the three progeny inbreds that originated from breeding schemes that involved one or 
more additional crosses of one of their parents, using the algorithm &\p = 0.5 resulted in 
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placement of the respective recurrent parent with the highest probability of ancestry. Raising the 
level of p to 0.99 resulted in both parents (B 14 - recurrent parent and MT42 the non-recurrent 
parent) of the index inbred A632 being ranked in the top two places. Using this level oip also 
caused a higher ranking (third position) for the non-recurrent parent (MT42) of index inbred 
A634. Use of/> at 0.99 did not cause the non-recurrent parent (CI 03) of index inbred (Va35) to 
rank into the top five places. 

For the index inbred (Pa91) that was bred from a more complex cross involving four inbred 
lines, the use ofp at 0.5 or at 0.99 resulted in the two parents (WF9 and OU4QB) being ranked in 
second and third places; highest ranked was inbred Va99 (Va99 is derived from the index inbred 
Pa91). Neither of the two remaining parents of Pa9l ranked in the top five places. 



DISCUSSION 



The current widely used North American soybean varieties are founded upon a relatively narrow 
genetic base of diversity. Gizlice et at. (1994) document that the U- S. soybean germplasm base 
is founded upon 20 plant introductions and that subsequent breeding has made repeated use of 
related parents. Molecular marker comparisons of elite U. S. soybean varieties compared to a 
sample of exotic varieties reinforce the conclusion that there is a relative paucity of genetic 
variation in U. S. soybeans. Narvel et ai (2000) have shown that the number of alleles detected 
among the exotics was 30% greater than among U. S. varieties. Thompson and Nelson (1998) 
report that very little exotic germplasm has been incorporated into the existing U. S. soybean 
germplasm base. Examining all pairs of pedigree relationships among the 490 soybean varieties 
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employed in this study showed that approximately 50% of pairwise relationships are related at 
the level of half-sib or closer, approximately 10% of pairs are related at the level of full-sib or 
closer- This set of soybean varieties therefore provides the basis for an extremely rigorous 
evaluation of the ability of SSR data to distinguish between varieties and of this algorithm to 
identify pedigrees. Pedigree breeding, including the use of related parents, is also commonly 
applied in the breeding of maize inbred lines. The set of maize inbreds used here thus also 
provides a meaningful evaluation of the marker data to discriminate among inbred lines and of 
the joint ability of the algorithm and of the marker data to allow a determination of inbred 
pedigrees. 

Use of the algorithm at p = 0.99 rather than at a lower level improved performance in terms of 
the percentage of correct assignations of parents and provided a greater statistical differential for 
probabilities for parents in comparison to the highest ranking non-parents. Use of the algorithm 
atp = 0.99 is more appropriate when it is known that the actual parents of the variety under 
examination are included among the set of index varieties. If it is not known that the parents are 
included in the index set then use of the algorithm aXp = 0.5 is more justified (Berry et al. 2002). 
For the soybean varieties, when p was used at 0.99, then 77% of all varieties that were queried 
for their parents had both parents correctly identified. Eight-eight percent of soybean parents 
were correctly identified across 43 index varieties that were queried for their parents. All 
varieties (with the possible exception of one variety where detailed pedigree information was not 
available) that ranked above true parents were related either to the mis-ranked parent or to the 
variety that was being queried for its pedigree. Our previous report of the use of an algorithm to 
determine hybrid pedigrees (Berry et al. 2002) showed a higher level of correct parental 



17 



Received from < S15 334 $883 > at 3111103 4:07:36 PM [Eastern Standard Time] 



03/11/03 THE 16:22 FAX 515 334 6883 . PIONEER HI -BRED DSM @056 



deternunations Xp - 0.99. Many of these soybean varieties have a high degree of pedigree 
relatedness. However, many of the maize inbred lines that were used in the previously reported 
study (Berry et at. 2002) were also highly related. It is, however, likely to be inherently more 
challenging to correctly identify parents following cycles of inbreeding because half of the 
alleles that are segregating in the first generation following the initial breeding cross will be 
subsequently lost as recurring cycles of self-fertilization occur. Thus, many of the alleles that are 
present in a hybrid, and which can therefore contribute to the identification of its pedigree, do not 
remain present in an inbred homozygous progeny. 



We examined the pedigrees of soybean index varieties when both parents of the index had been 
removed from the set of candidate varieties. Direct pedigree descendents with the index variety 
as one parent then usually ranked higher than other varieties, including varieties that were 
grandparents or sister varieties of the index variety. When all parents and direct derivatives of the 
index variety were excluded from the analysis then the predominant classes of varieties ranking 
in the top five positions were derivatives of the grandparent of the index variety (32%), 
grandparents of the index variety (16%), derivatives of the parents of the index variety (16%), 
and half-sibs of the index variety (13%). The SSR data that were available to us did not allow a 
thorough or very precise assessment of how varieties with different degrees of relatedness would 
rank as members of the pedigree in the event that the true parents were not present in the 
database. Nonetheless, when parents were excluded from the analysis then varieties that were 
very closely related to the index variety ranked highest. Direct descendents dependent for their 
pedigree upon the index variety, if present, tended to rise above varieties included within other 
classes of pedigree relationship to the index variety. When varieties directly descended by 
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pedigree from the index variety were also excluded then a grandparent ranked into first position 
for 33% of the varieties that were examined. Direct pedigree derivatives of one or more of the 
parents of the index variety had an equal level of occurrence when parents and derivatives of the 
index variety were excluded. Further investigations of the identification of grandparents will 
require a dataset including all grandparents of each index variety and will also require a revised 
algorithm to take account of pedigree contributions from four varieties as opposed to pairs of 
varieties which forms the basis of the current inbred algorithm. 

For the maize inbred line pedigrees, use of the algorithm either aXp = 0.5 or tip = 0.99 resulted 
in the correct identification of both parents in all cases where the breeding scheme was an initial 
cross of two parental lines followed by subsequent cycles of inbreeding (i,e. for the inbreds 
Mol7, Va99 and W64A). The relatively high level of robustness for results with maize inbreds at 
p = 0.5, in contrast to the results obtained from analyzing soybean data (where 56% of varieties 
had both parents correctly identified when p = 0.5 was used) could be accounted for by the 
smaller sample size of maize inbreds and by the lower degree of mean pedigree relatedness 
amongst this selection of inbred lines in comparison to the soybean varieties. Thus while several 
inbred lines in this set are closely related, there remain many inbreds that have little or no 
pedigree relationship (Senior et al 1998). 

The inbred algorithm correctly identified both parents of the three maize index inbreds that had 
been bred from bi-parcntal crosses that involved equal contributions (by pedigree) from both 
parents. For the three bi-parental crosses that involved subsequent additional crosses of the 
recurrent parent (and thus significantly biased contributions by pedigree to the index variety 
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from the recurrent parent) then use of the algorithm correctly identified each of the recurrent 
parents. The algorithm was unable to identify the non-recurrent parent in most cases, but this 
result would be expected because one backcross reduces the expected pedigree contribution of 
the non-recurrent inbred to 25%. More generations of backcrossing using the recurrent parent 
then farther reduce the expected pedigree contribution of the non-recurrent parent by half at each 
generation (successively to 12.5%, 6.25%, 3.125%) with the pedigree contribution of the 
recurrent parent rising accordingly. Since several inbred lines of maize are related by pedigree 
then it is not surprising that the level of pedigree or SSR similarity of a non-recurrent parent to 
the index progeny can fall below other inbred lines that are related to the index variety. The 
algorithm was not able to preferentially identify parents of the inbred line Pa91, which was bred 
from a complex breeding scheme involving four parents with equal contributions by pedigree. A 
more suitable algorithm is needed to take account of four way crosses. However, such a need is 
primarily academic because most breeding crosses in commercial maize breeding, and indeed for 
most crops, are bi-parental. 

These soybean data had a mean of 5.5% missing data per variety and a mean of 1.1% loci where 
a progeny was scored with an allele that was not also scored in either or both parents. Such 
apparent non-Mendelian or exclusionary profiles can he due to pollen contamination during 
inbreeding, cross contamination in the field or laboratory, scoring errors in the laboratory (e.g. 
scoring +A, predominant stuttering, spectral pull-up, secondary binding sites or polymer spikes), 
or incorrect pedigrees. Another source of apparent exclusion is through the use of a seed source 
as a parent that is still heterogeneous due to inbreeding being incomplete. Cycles of inbreeding 
then continue so that when those seed sources are used in the future as sources for SSR profiling 
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to represent the parental genotype they will have lost alleles due to inbreeding that have already 
been passed on to a progeny. Alternately, residual heterozygosity within seed sources can result 
in low frequencies of heterozygotes or off-type segregants which may, by chance, be sampled in 
the progeny, but not sampled in the parent. In this study we sampled six plants to represent the 
variety which may be insufficient to capture alleles existing at low frequencies within the seed 
source. And even if the allele was sampled, it may not have been detected following PCR 
amplification due to predominance of the most frequent allele and allelic competition effects. 
Hall (2002) has also reported the occurrence of apparent non-parental SSR alleles. Mutation can 
also affect SSR profiles. Vigouroux et ai (2002) have estimated mutation rates of 7.7 x 10 4 per 
generation for dinucleotide SSRs and an upper 95% confidence limit of 5.1 x 10" s for SSRs with 
longer repeat units. A level of error or discrepancy in expected SSR profiles are thus inevitable 
for some, if not all crop plants. We therefore evaluated the robustness of the algorithm and 
dataset by rerunning the algorithm using datasets that were simulated to have up to 20% 
additional levels of missing plus 20% mis-typed data beyond the level that was received from the 
laboratory. The algorithm maintained its initial level of robustness with up to an additional level 
of 10% both missing and mis-typed data, provided data from at least 100 SSR loci were used- 
Fewer loci (60) were capable of retaining this degree of robustness in the evaluation of the 
hybrid pedigree algorithm using maize hybrids (Berry et al 2002). The loss of parental alleles 
that occurs during the inbreeding process, in contrast to their retention in a hybrid progeny 
compared to its parents, probably underlies the need to use data from a greater number of loci to 
maintain robustness for the inbred algorithm as compared to the hybrid algorithm. 
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It was anticipated that determination of pedigrees following cycles of inbreeding might be more 
challenging to accomplish than to determine pedigrees of hybrids where the total nuclear genetic 
contributions of both parents are preserved. Nonetheless, these results show that the algorithm 
can be used effectively to identifying parents of inbred genotypes. Nearly 90% of soybean 
parents were identified This is a set of genotypes which, due to the relatively narrow founder 
base and subsequent cycles of development through the use of related crosses, provides an 
extremely rigorous test of the algorithm and of the discriminatory power of the marker data. 
Supplementary data also show the capability of the algorithm to identify parents of maize inbrcds 
that have been developed in a pedigree system using two parents. Use of this algorithm with 
currently available codominantly expressed molecular marker data has also been shown to have 
practical feasibility because of the high degree of robustness that is evident and which extends 
well beyond the realm of aberrant or unexpected marker data that is encountered. These types of 
em>r or unexpected marker data can include laboratory error, sampling effects or the use of 
different seed sources for the actual parental source compared to a more inbred source that 
becomes available later to represent the parental genotype. This algorithm has application in a 
number of fields, including conservation biology, population genetics, and to assist in the 
protection of intellectual property rights. 
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Table 1. Calculations of ancestry for homo^gous index inbreds: Cases that must be considered 

for example of genotype aa. 



SSR 


Index 


Inbred/ 


Inbred j 


1 


aa 


aa 


Aa 


2 


aa 


aa 


Ax 


3 


aa 


aa 


Xx 


4 


aa 


ax 


Ax 


5 


aa 


ax 


Xx 


6 


aa 


XX 


Xx 



x is any allele different from a, but not missing 
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Table 2. Probability of observing the index [P(SSR\iJ)] turning inbreds i and; are ancestors: 
Calculations for SSRs 1 to 6. 



ssrH 


P(SSR\iJ) 


X 


p>(4/4) + p(l-p)(l/2+lfn*l/2) +p(l-p)(l/2+l/n*I/2) + (1-pfU'ri) 


2 


p J (3/4) + p(l-p)(l/2+l/n*l/2) +p(I-p)(I/2*l/2+l/n*l/2) + (1- 
pf(Hn) 


3 


~p J (2/4) +p(I-p)(l/2+I/n*l/2) +p(l-p)(I/n*l/2) + (1-pTO/n) 


4 


p J (2/4) + p(l-p)(l/2*l/2+l/n*l/2) + p(l-p)(l/2*l/2 + 1MV/2) + (I- 
pf(lln) 


5 


p l (U4) + P (l-p)(l/2*l/2+l/n*l/2) +p(l-p)(l/nV/2) + (l-pT(lln) 


6 


' p l (0!4) +p(l-p)(l/nV/2) + P (l-p)(l/n*l/2) + (l-pf(l/n) 



The four terms in each case are in order of the four possibilities when inbreds i and; are 
ancestors: (1) the alleles of both i and; were passed to the intermediate hybrid, (2) i came 
through but noty, (3); came through but not i, and (4) neither came through. Missing alleles are 
not considered. 
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Table 3. Probabilities of ancestry and pedigree relationships for soybean varieties where both 
parents did not rank above non-parents. 



Case no. Index variety Rank 

I 95B97 1 Parent 2 



Possible ancestor 



Probability 



A2943 



A4595 



Hark 



Kent 



P9583 



P9641 



S30J2 



2 Full sib of parent 1 

3 Parent 1 



1 
2 
3 
4 
5 
6 

1 
2 
3 
4 
5 
6 
7 
8 
9 
10 



Parent 1 

Derivative of parent 2 
Multiple backcross of parent 2 
Derivative of Parent 2 
HalfsibofA4595 
Parent 2 



Parent I 

Derivative 

Derivative 

Derivative 

Derivative 

Derivative 

unknown 

Derivative 

Derivative 

Parent 2 



of parent 2 
of parent 2 
of parent 2 
of Hark 
of parent 2 

of parent 2 
of parent 2 



1 Parent 2 

2 Derivative of parent 1 

3 Derivative of parent 1 

4 Parent 1 

1 Parent 1 

2 Full sib of P9583 

3 Parent 2 

1 Parent 2 

2 Derivative of P9641 

3 Parent 1 

1 Parent 1 

2 Derivative g f parent 2 



1 

0.5822 
0.4124 



1 Parent 1 0.9977 

2 Multiple backcross of parent 2 0.7999 

3 Parent 2 0.1999 



1 

0.9956 
0.0034 
0.0006 
0.0004 
0.0001 

1 
1 

2.1E-09 
1.4E-09 
3.1E-10 
1.1E-13 
3.8E-15 
4.6E-17 
4.7E-21 
2.7E-21 

1 

0.9990 
0.0011 
3.0E-04 

1 

0.8801 
0.1199 

1 
1 

3.7E-06 
1 

0.9321 
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10 



YB30K01 1 Parent 2 

2 Half sib of parent 1 

3 Full sib of parent 2 

4 Halfsibofparent2 

5 Full sib of grandparent 

6 Derivative of parent I 

7 Full sib of parent 2 

8 Full sib of grandparent 

9 Parent 1 

YB41Q01 1 Parent 2 

2 Full sib of parent 1 

3 Full sib of grandparent 

4 Full sib of grandparent 

5 Parent 1 



1 
1 

7.9E-09 
3.3E-09 
1.2E-10 
3.0E-11 
2.0E-11 
8.7E-12 
1.1E-12 

1 
1 

7.3E-05 
4.1E-09 
9.1E-10 



Results for 33 (77%) varieties where both parents were ranked first and second are not included 
in this table (see Figures 1 and 2). 
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Table 4. Probability of ancestry for five individual 
from different numbers of loci (50, 100, ISO, 236). 



soybean varieties using SSR data obtained 



Inbred 


L50 

Possible ancestor 


L100 

JrTOD rossioie ancestor 


Prob 


L150 
Possible ancestor 


Prob 


L236 
Possible ancestor 


Prob 


P=0.5 
93BII 


XB31C 


0.9461 XB31C 


I 


XB3IC 


1 


XB3IC 


1 




A3415 


0.8006 A3415 


0.9362 


A34I5 


0.9146 


A34I5 


0.9954 




XB38A01 


0.0256 WILUAMS 


0.0429 


WILUAMS 


0.0809 


WILUAMS 


0.0046 




P927I 


0.0251 A3242 


0-0155 


YB30L0I 


0.0034 


A3242 


0 




YB3QLQ1 


0.0232 YB3QW 


0.0015 


A3242 


0.0006 


DOUGLAS 


0 


Aim 


COOK 


0.7748 BRAXTON 


0.9725 


BRAXTON 


1 


BRAXTON 


1 




XB63DO0 


0,2841 YOUNG 


0.5302 


YOUNG 


0.8910 


YOUNG 


0.9929 




S6262 


0.1826 COOK 


0.3872 


P9641 


0.0404 


XB63DO0 


0.0071 




YOUNG 


0.1755 XB63D00 


0.0496 


XB63D00 


0.0254 


96B32 


0 




BRAXTON 


0.1065 P964I 


0.0328 


COOK 


0.0245 


P964I 


0 


P9443 


DOUGLAS 


0.8086 A3415 


0.5557 


FAYETTE 


0.8760 


FAYETTE 


0.9885 




A34I5 


0.7629 FAYETTE 


0.4957 


A3415 


0.7034 


DOUGLAS 


0.8847 




WILLIAMS 


0.0887 DOUGLAS 


0.4855 


CX399 


0.1671 


A34I5 


0.0846 




YALE 


0.0501 


0,2032 


CX260C 


0.1273 


WILUAMS 


0.0348 




P9394 


0.0411 WILLIAMS 


0.1608 


WILUAMS 


0.0948 


CX399 


0.0062 


S38T8 


S3535 


0.8711 


0.9993 


S3S35 


1 


S3535 


1 




S4644 


0.4543 S4644 


0.9988 


S4644 


1 


S4644 


1 




YB44R01 


0.2762 YB40M0! 


t),0012 


YB37Y00 


0 


A4268 


0 




YB40M0I 


0.1087 


0.0004 


93B65 


0 


YB44R0I 


0 




YB44Q01 


0.0325 XB57y00 


0.0001 


44268 


0 


YB37Y0Q 


0 


YOUNG DAVIS 


0.6589 


0.6551 


DAVIS 


0.6324 


DAVIS 


0.9752 




XB63D0O 


0.4942 ESSEX 


0.5979 


P964I 


0.5524 


P964I 


0.5397 




96B32 


0.3122 PSW/ 


0.3409 


COOK 


03231 


ESSEX 


0.3273 
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COOK 


0.0707 CCl/A 


0.1692 


ESSEX 


0.2817 


96B32 


0,1299 




OGDEN 


0.0606 9033* 


0 1315 


96B32 


0.1933 


COOK 


0.0235 


p=0.99 
















mu 


XB31C 


1 X331L 


1 

I 


XB3IC 


1 


XB31C 


1 




A3415 


0.7777 AJ9*-J 


0.9999 


A3415 


1 


A341S 


1 




A3242 


0.0001 A3 24 2 


0 0001 


P9443 


0 


WILLIAMS 


0 




P9443 


0 rWw 


A 
V 


A3242 


0 


A3242 


0 




WILLIAMS 


0 WILUAMo 


u 


WILLIAMS 


0 


FAYETTE 


0 


Aim 


BRAXTON 


1 BRAAt UN 


1 


BRAXTON 


1 


BRAXTON 


1 




YOUNG 




0.9903 


YOUNG 


0.9987 


YOUNG 


I 




P9641 


0.0092 P9641 


U.UU7i 


96B32 


0.0012 


XB63DQ0 


0 




96832 


0.0005 96B32 


U.UUUJ 


P964I 


0.0002 


96B32 


0 




DAVIS 


0 D>4 K/j 


A 

u 


DAVIS 


0 


P9641 


0 


P9443 


DOUGLAS 


0.9998 DOuulAo 


A OQQQ 
V-7777 




0.9995 


DOUGLAS 


1 




FAYETTE 


0.7010 FAY but. 




DOUGLAS 


Q.9993 


FAYETTE 


1 




CX260C 


0.2345 CX260C 






0.0006 


CX260C 


0 




A3415 


0.0644 


V.uOHJ 




0.O0Q5 


CX399 


0 




S394I 


0.0001 AP333U 




P9394 


0.000 1 


A3415 


0 


S38T8 


$3535 


1 


1 
1 




1 


S3535 


1 




S4644 




1 


$4644 


I 


$4644 


1 




YB40M01 


Q ID4UMUI 


V 


93B67 


0 


A4268 


0 




YB44RQI 


0 ^5P7P 


0 




n 

V 


YB54J0O 


0 




93867 


0 YB44R01 


0 


YB37Y00 


0 


YB44R0I 


0 


YOUNG DAVIS 


1 WWS 


1 


DAVIS 


i 


DAVIS 


1 




ESSEX 


1 £SS£* 


1 


ESSEX 


l 


ESSEX 


1 




P9641 


0 FJW 


0 


COOK 


0 


$4240 


0 
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2 MAIZE - THE PLANT AND ITS PARTS 

R. Scott Poethig „ 
Department of Agronomy, Curtis Hall 
University of Missouri 
Columbia, MO 65211 



a ^™tc to an aoDreciation of plant morphology is , 
One of the greatest deterrents to an ^PP« v This prob lem is 

the terminology used to describe various ' ^f j^ el n]15ual struC ^re. 
compounded in the f« * ^ ^ ?odJ composed of stems, leaves 

We all learn that plants have a vegetative o y y d stamelis . 

and roots, and that J^^id? of lea^ w 0 Pl kin ds of stems, two 

Maize, however, has at ^\*$%^£J£to glumes, lemmas and paleas 
kinds of roots, and two ^^J^%l T f nn ^i v \ese parts are arranged 
take the place of sepals and petals . tortuna jr> ^ morpho i 0 gy as 
in a relatively simple fashion so the ^ 0 ^ £ soroe Q{ the 

not as difficult as it ^J^^p^'Sf describe their organisation, 
most important P^ts °f the ro f^% P ^ ^° ntal morphology of maize have 
More detailed descriptions of ^ t^ deveiopme & h (l949> reprinted 

been provided by a nurob^ mvest^awrs^ development. The 

1980) gives a good g^nejal pict vegetative and reproductive 
external morphology and the fcsta lo gy 01 tn S ^ ^ &nd Abb 
shoots have been studied by Bonnett U^. ^ £ } while the ro0St 

and co-workers (Abbe and Phamey , 1951, g * ^ of ; Ran dolph (1936) 
comprehensive descrxptions of tte^bryoajiy^ rf ^ plant , 

SU b, b "sL?ff ^has^e^rinted in the recent edition of Corn 
and Corn Improvement (1976). 

Theo^aniz^^ of ?hT pfant bo^y if Tif 

famUyT^HTGramineae, and as in all grasses m <£ * h ffiaize plant 

tissue (Fig. la). To appreciate f • ^^y^J^J. lb ). Stripped 
it is helpful, therefore, to see it in a leal less s v » a 
naked, the maise plant is not very «J>«JJJ^- ^SoTorTugarcalle- The 
Slender, segmented shaft smnlar to a stalk ° f J^y^ J f leaf at tach- 
enlarged joints along the s em *ewto,^ * ™ e |ternode. Each node 
ment; the stem segment between ' ^^^e n agh^Fg leaf , giving 

bears a single leaf in a position opposite ^J™,^ 1 ^. J i; 2). This 

2.35* * " appendases ' 

wherever they occur on the plant. 

Maise has unisexual r^ ^^J^ FtilETl 
flowers are boated at the apjca ^ P^^^^frs are found in"55e to 
branched inflorescence ■ of short branches near the middle 

several compact ears, located on the enas w »* 
of the stem (Fig. lb; 2). 

Tbis partition of male and tm* 
Aguish?, < 5*2£» 3^ JS. V«« 
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flower used as a female parent, an 
yields only one $eed. 



especially tedious job when each flower 




B 



& plant drawn without leaves anu h wh] le the 

^i* JS^ S St^)' The ear shoot 

Mai ,e also differs from closely relabel spec£ * ^t^SS 
few branches. Only the lower 10 to 12 d . Ab ove-ground 

branch primordia, and most of these remaui s^pp ^ s ^ 
primordia develop into ear shoots whUe m t ture t0 th e mam stem 

nodes develop into tUlers-branches ^enacai m d 
Co^nerclal fybri*W •J 1 ^^ contrast, some "varieaes" 
rSve^al UH^ndTaylroduce 2 ears on the ma. ste* - 
some ears on tillers. 

. western: During the first ^J^J^SS^^TZ 
point^BTlhe-stem lays down all the node ^ m ^ f ^ 
fhen **«*f 4 ^^%« Sfough the plant may be 3-4 feet in 
not more than 3-4 mcnes 
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height (Fig. 3). Bubse^Uy £ « ^^SS" StS^ 
most of the growth occurring at ttv e * however, and remain below 
6-8 internodes do not parkcipa ta in ^rowtn These BubterranMft 

ground where they produce * e ™**^™ 0 tl. stem forming a distinctive 
Lernodes taper sharply towards the a fe* inch* above 

region, the crown (Fig. lb). The stem is internodes from 

ground, and tapers gradually ' *™*» ^Je^ociated with the axulary bud 

and are smoothly cylindrical. 



m»lc floret 




anther? 



Figure 2 The major parts of the maize plant. Drawings ; in part from P. 
assembled" by M. M. Johri and E. H. Coe. 
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The stem of an ear shoot called ^f In 2 «^ 
main stem in being ^ ive ^^^er SSJSir in shape and size, and 
nUs of the shank J« ^"SEott surface. Secondary ear .shoots 

tend to have a ^d rath er *an s mo ^ but ^ m most 

^^WSiS- of J? apical ear is prevented. 




Figure 3. 



A ^ wee, old 5? S 
SStX^S if-rr« short at this stage. 



Thetassel: The tassel , at *J Sf 

series" of large branches spAej) covered ^ & 

bearing branches (spjkelete: Fig. ^ ^cn * on a short stem 

?wo spikelets, one on a long ™<S£$g$&^ \ urn , produce s two func- 
(sessUe) (Fig. 4a). Each of ^^f^in'bcth st^Jks and a pistil the 
ioOTorets. Although tasse ^J^^lSttoed. making the floret 
pistil normally degenerates soon i after^ it is bage of the Uss ^ 

brSe^nvirore^^ - d « ^ 

common on tillers. 
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it s-h-natie drawtae of a pair of tassel spfltelets (A) and a pair of 
Figure 4. f°hematK < turnout « ^ P ^ tower ta u, e e?r sputelet 

Sor * d^»»t. p.s - pedicellate sptofet; juj. - 
sessfle spfltelet; gl - glumes; le - lemma; pa - palea; fl floret. 

other located between the two florets <*« Egg> < ' * cal 6tructure£ 

SKSVEK'. ^Cf^opeS a. Lir Up. 

.at St ^^TS^llSSS 

^ra'cysssra^ . *» - sp.™ cea*. 

The ear- The ear is morphology similar to the tassel, although this 
resem^icfis M b, 3rferences h , t£«* Uve «. 
The crucial difference between them is, of course, mat m 
male flowers, and the ear bears female «*>>-™»Jf ^rdifSri 
to the fact ttat during tt. «^ £££j*&FS. p£tU develops 
arrested at an early stage in w eu i «» * 0 varv, which terminates m an 
fully. Each functional ear floret 5?a7y is a single embryo sac , 

elongated style, or nft (Fig. 5). J* 1 ™" J 1 \* loid ceUs * resulting from 
The emhryo sac is the product of one of the four napi dg _ 
the meiosis of the megaspore mother cell White ns cnr 
generate, the nucleus of tim ceU £T ?*J ot these nuclei 

nuclei within a «^ T/Z^o 'J where they become 

^^tarsAS: saris* ^ * sac 
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antipodal cells . 




Figure 5. 



Radial longitudinal section of an ovary with an unfertilized 
embryo sac (after Randolph, 1936). Upon f ertihsation the 
nuceUus is digested by the expanding embryo sac and the tissue 
surrounding the nuceUus is transformed into the pericarp, si - 
silk; e.s. - embryo sac; nu - nucellus; in - integuments. 

The ear also differs from the tassel in that it has no major lateral 
branches Its thick, Kgnified axis, the cob, is homologous to the central 
S of the tassel. As in the tassel, ear spikelets come in pairs but m 
theea? they are equal in size and only one of the florets in each spikelet is 
func?£ial (Fig 4b). An ear therefore has an even number of parcel rows 
S^SSy SL kernels equal to the number of spikelets on the cob. The 
number of rows (or ranks) of kernels ranges from 4 to 30. 

The glumes, lemmas and paleas of the ear spikelets are readily visible 
in an ^fertilized ear, but are soon obscured by the enlargement of the 
SvaS? Serialization. In a mature ear these structures are represented 
bTtL chif ^Tadheres to the cob and the base of the kernel after it as 
shelled. 

The leaf: Maize produces three kinds of vegetative leaves: Mag 
leaver ^ leaves an! prophyjls. A foliar leaf is ^locate d Jjeacfccf the 
HodeT o n the main stem, husk leaves are located on the shank of the ear 
shoot, and prophylls are found at the base of the shank between the ear 
shoot and the stem (Fig. 2). 
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The foliar leaf has two ^ ^fS^Sm^^S^S 
tending away from the > stem *• ^jj 1 Kf£ P £bX? of a spongy 
around the stem (Fig. 2). ^^^i ™ a llel longitudinal veins. This 
network of cells traversed by 'a series of P a ^g ' k ^ e d translucent struc- 

f^c ^ iSffffe^ -re rigid 

midrib. The sheath completely ^orcles ™^™> of that demode., 
which it is attached and may extend [ the ^ * n g£ aths provide most of 
During the early development of the P At the bound- 

the mtchanical support necessary to keep the ste* translucent 
ary between the Made an d the ^^^^tLTl^ sharply, 
tissue, In this region, the leaf biaoe ana i translucent tissue 

forming an indentation m the leaf ™rgin. T **!£ a ge T £ e is ^ thin 

adjacent to this indentation is known as the Tne *£H£ 

collar of filmy tissue located on the inside of the hinge. 

upper ones are arranged distichously . 

Located between an ear shoot and the stem, the P^P^JS^-ffiST) 

internode, leaf and axillary bud. 

The roof More is known about the growth, cell biology, P*?**}?** 
and a^mFof the primary maise root, or radkle, ^ JL^Sf * fS?6?^Id 
o?gan of the plant. Its histological structure, described by Sass UJW** 
rr- ik^^k mgaca -iq rvnieal of roots in general. The apex oi tne root i* 

rThtir ZSZIT™ .division is resmcte^o the apical 3 mm of the 

root, and occurs at a ^f'f^f™ *thT o? elongation being 
elongation extends 8 mm behind _ the ^ x > Those frterested in 

ErSkson (1979; 1980) and Green ^ s ^u£L S 

meters that must be taken into consideration in such studies. 

n.. nHmarv root represents the basal end of the plant axis, which In 

prtnordia of a few rfvenUUous roots are 1 ^*i 0 f™SSrS. « ^ 

^iTSSJS'^U.. a-d *o appear 
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SeTabove ground are known as bracjLrpots, 

« u^^taiiv for several feet before turning 
Adventitious roots ^ °TsvlSL o >* singk plant often covers a 
downwards. As a result the i root system « r » S v be as 

region 6-8 feet in ^^^iJt^^ ^Jly in the region 
much as 6 feet. As it grows we s rwi unicellular root hairs. 

" w been estinatea K be 6 

miles. 

s&^^jsjss^jf^ssrs^ ^ ph_ is stm 

unavailable . 

The si* is receptive to ^ 
after a pollen grain lands on a sdk it sen ds out proces s the 

silk and grows downward ^^^^J^le tip of the pollen 
vegetative nucleus and the two ^f™ ™ JJJ Upon reaC hing the embryo 
tube where they remain ^^..^ ^ end of the pollen tube bursts, 
sac, 12 to 24 hours after ge j™£*>^ ^ She two polar nuclei 
releasing the two sperm. One s P.^ rt ^ c r 1 £ i !JL 1 cel i that gives rise to the 
in the Center of the embryo sac ^ a % g nU Sus to form the 

endosperm . ^^J^^^^^aZ JS the egg nucleus are 
2 ygote. As often as 2% of the ™ e J5JL l ' m j ns with the extra sperm nuclei 
fertilized by sperm from ^^.PgJ 1 *gg>' This phenomenon, called 
being somehow lost (Sarkar and ^^^^twSeB the genotype of 

in detail by Randolph (1936) . We ^^/ fold ^crease in the volume of the 
40-50 days and is b £ a an" the accumulation of food 

embryo sac. The growth of the « mD J^° h t d v 40 and the remaining 
reserves in the endosperm is completed by about aay «j, an 
10-20 days is spent maturing and drying. 

a ^ »»- b- «*- -£ g£ £Z^FSi£%& *2 

«bntt (lit- •)• , The SSi^f is tt««fore genetically idaticsl 
^ iLX^rX:' £ e»br y o present th, ne* 

generation. 

The endosperm makes up about g J*£S£. "xVis 

the food source for the -bryo to r sever al days after ■* g ^ ^ 

food takes the form of mtracell^r starch) grams ana p ^ endogpem 
concentrated to varying degrees m_ 'centration of starch and 
(Duvick, 1961). In flint-type kernels the J^™,, than ta the 
protein bodies is ^higher aroimd ^ P^^Sernal layer, and a soft, 
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characteristic of specific races of maize. Other common endosperm traits, 
such as Sugary, floury or shrunken, are single gene mutations and can exist 
in either a flint or dent background. 




Fiffure 6. Longitudinal sectors of a mature dent kernel, taken perpen- 
dicular (left) and parallel (right) to the upper face of the kernel 
(after Kiesselbach, 1949). pe - pericarp; en - endosperm; al - 
aleurone; sc - scutellum; co - coleoptile; pi - plumule; ra - 
radicle; cr - coleorhi2a. 

Much of our understanding of gene action in maize is based on the 
analysis of genes affecting the pigmentation of the external layer of the 
endosperm, the aleurone . This specialized single eell layer is the only part 
of the endosperm capable of becoming intensely pigmented. Internal endo- 
sperm cells may be either yellow or white. 

The embryo is located on the broad side of the kernel facing the upper 
end of the ear, beneath a thin layer of endosperm cells. Most of the tissue 
xn the embryo is part of the scutellum, a spade-like structure concerned 
with digesting and transmitting to the germinating seedling the nutrients 
stored in the endosperm. The shoot and root axis are recessed m the outer 
face of the scutellum. In a mature kernel, the shoot (plumule) has 5 to 6 
leaf primordia that are arrested at successive stages of development (Abbe 
and Stein, 1954). Surrounding the shoot is a cylindrical structure called 
the coleoptile. Upon germination, the coleoptile elongates until it is above 
ffround and" 'is then ruptured by the more rapid expansion of the rolled 
leaves within it. The root is enclosed in a sheath of tissue called the 
coleorhiza. Unlike the coleoptile, the coleorhiza does not elongate very 
much, and gives way to the radicle as soon as it emerges from the seed. 
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ABSTRACT 

• r j tn The studv of biolosty and to applications such as the 

Dtttmittiulon of pamage ■ Fundamental W *e «*> ol mo w inlufficiem 

mining and miitypcd da* to demonic the utility and fle»t»l>qr of ih,i algorithm. 



DETERMINATION of parentage is fundamental to 
the study of reproductive and behavioral biology. 
The increasing availability of highly discriminant ge- 
netic markers for many diverse species provides the 
potential to uniquely characterize individuals at numer- 
ous loci and to unambiguously resolve parenrage whci e 
genealogical relationships are unknown, in error, or in 
dispute. 

Identification of parent-progeny relationships in wild 
populations of animals and plant* provides insights into 
ihe success of various reproductive strategics (ElL- 
strand 1984; Smouse and Meagher 1994; Aldersom 
et ai 1999) and has allowed for the implementation 
of management programs to conserve genetic diversity 
(Miller 1975; Rannah and Mountain 199"). The 
association of pedigree with physical appearance or per- 
formance in domesticated animals and plants allows 
parents that have contributed favorable alleles for desir- 
able traits through selective breeding programs to be 
identified (Bqweks and Meredith 199?; Sefc et ai 
1998; Vankan and Faddy 1999). These applications of 
associative genetics facilitate further progress in genetic 
improvement through breeding. Establishment of par- 
entage is also useful to secure legal rights of guardian- 
ship in humans, to help protect intellectual property in 
plant varieties, to validate breed pedigrees of domesti- 
cated animals, to protect stocks of fish, and to identify 
provenance of meat ih-.u is available in supermarkets 
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(Got* and Thaller 1998; Primmer et ai, 2000; White 
ctaL20W). 

Most studies of pedigree have utilized exclusion analy- 
sis where the molecular marker genotypes of either one 
or a restricted number of potential triplets of offspring 
and putative parents are compared. Ofien the identity 
of the "mother is not in question; the maternal profile 
is subtracted from thai of the offspring and the deduced 
paternal profile is then compared with candidate father 
genotypes (Ellstrand 1984; HAMuiCK-and SchnabEL 
1985). Individuals who could not have contributed the 
paternal genotype are excluded; the remainder arc pos- 
sible parents. Nonpaternity in humans b generally de- 
clared only on the basis of exclusions exhibited by at 
least two unlinked and independent loci. This criterion 
of exclusion reduces the likelihood of a false declaration 
of nonpaternity on the basis of marker results that are 
actually due to mutation within the phylogeny. Bein et 
al. (1998) show that evidence of nonpaternity should 
require exclusions at loci on different chromosomes to 
avoid erroneous conclusions that would be made due 
to nondisjunction at meiosis leading to uniparental in- 
heritance. A requirement for at least three independent 
exclusions to declare nonpaternity in humans has also 
been instituted (Guns et ai 1997). In studies of natural 
populations of animals or plants where numerous par- 
ent-progeny triplets are examined it is usual to accept 
a single exclusionary event as evidence of nonpaternity 
(Marshall et <d> 1998). Paternity toting has been ex- 
tended to situations where DNA from cither parent is 
unavailable. For example, paternity can still be estab- 
lished in circumstances where the putative father is de- 
cc;*ed tautjnMgrcni s are still uliyeJ^^is^J^^ 
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dL . m oastrate that paU-mifc C ;m l* d^nnimrd in cxse> 
where the muthcr i* unbailable fur testing Uso d 
p-u-ciallv rc f .<>n<miat'd du- DNA profile ol a 
missing crocodile p.ucm using proftk* of the mmher 
and progeiw. 

CHAKRABiik-iT «/. * 19$*J wk! Smouse and MkaChkk 
(l<iu-t) report tfut reliance upon exclusion alone hu> 
usuallv failed to unambiguously wolve paternity. Lmu- 
cadonif have stemmed from die use of an insufficient 
number of independent hypcrwiahle loci. Other sutu- 
ticat methods are therefore required to calculate the 
likelihood of paternity for each noticNduded male 
(Berry and GstsssR 1936; M&tGKtR 1986; Meacher 
and Thompsom 1986; Thompson and Maacher 19ST; 
Devun ttal. 1988; BERRY 1991). Marshall etal. (1993) 
draw attention to the quality of data that is encountered 
prtCticalW in genotypie surveys. Maternal genetic data 
may or may not be available, data may be absent for 
some candidate mates, data may be missing for some 
loci i'n some individuals, null alleles exist, >nd typing 
errors occur. Reconstructing or validating the pedigrees 
of varieties of cultivated plants often provides additional 
challenges because their phytogenies can reveal appar- 
ent exclusions that masquerade as non-Mendeltan in- 
heritance. For example, apparent exclusions can occur 
in circumstances where an individual is used as a parent 
prior to completion of the inbreeding process. The de- 
velopment of parent and progeny then continue on 
parallel but separate tracks thereby allowing the possibil- 
ity that alleles that are subsequently lost through in- 
breeding in the parent can still become fixed In the 
progeny. It is also possible to create many offspring from 
a single mating and to use the same parent repeatedly 
in "backcrossing" Therefore, many individual inbred 
lines, varieties, or hybrids can be highly related, f n con- 
sequence, there are numerous (and often very similar) 
pedigrees. The effective number of marker loci that can 
discriminate between alternate pedigrees is proportion- 
ally reduced as parents are increasingly related. Conse- 
quendy. inbred lines can be more similar to one or 
more sister or other inbred* than those inbreds are to 
one or both of their parent*. 

U has not been usual to search among hundreds of 
individuals to identify the most probable maternal and 
paternal candidates for a specific progeny. Most studies 
of parentage are in circumstances where there is a prion 
information for at least one of the parents (usually the 
maternal parent) . Limited availability of marker loci and 
the lack of verv hi^h-throughput genocyping systems 
offering inexpensive dwapoim costs may have focused 
research on studies that involve relatively few individuals 
and where there is at lea* M#mc « pnuri indication of 
'parentage. Studies that have Ix-ini couducrcd without n 
brinri information on parental mrhufc specie* where 
reproductive Iwiuiviur render iiU'iiulicaiiim of the ma* 
u-nial parent difficult or impossible. Examples include 



thMw undertaken on bird* dial puu liu- hmod paradi- 
gm (Ai.ukrsu* at ai UW) <» csuu-pir copulation 
(WVlTON it aL or on species sudt ;l> the wombat 
tliar arc ditTuiult to observe in \\\<: wild 'T.vvr.oK el ni 

i yy ~ } . 

Two circumstances favor a revised approach to the 
statistical analysis of pedigree. First, mulecukir marker 
technologies are rapidly developing and will Allow nu- 
merous loci to be typed for thousands of individuals 
rapidlv and inexpensively. A greater number and diver- 
sity oflarger-scale Studies of pedigree can be expected 
within the plane and animal kingdoms including individ- 
uals in which there is no prior knowledge of pedigree, 
A larger number of markers mean a greater chance 
for errors. Therefore, the second circumstance follows: 
"Procedures that are efficient and robust in the Eace of 
apparent exclusions, missing data, and laboratory error 
arc required. 

The purpose of this article is to describe and evaluate 
a methodology thai can be used to quantify the probabil- 
ity of parentage of hybrid genotypes. We focus on par- 
entage because it is the primary focus of published litera- 
ture and it is the easiest level of ancestry to understand. 
The method is robust in the face of mutation, pseudo- 
nonAlendelian inheritance (apparent exclusions) due 
to residual heterozygosity in parental seed sources, miss- 
ing data, and laboratory error. The methodology has a 
number of advantages: (i) It can accommodate large 
datasets of possible ancestors (hundreds of inbreds or 
hybrids each profited by >i00 marker loci), (ii) it does 
not require prior knowledge about either parent of the 
hvhrid of interest, (iii) it does not require independence 
of the markers, and (tv) it can successfully discriminate 
between many highly related and genetically similar ge- 
notypes. We demonstrate the effectiveness of this ap- 
proach to identify inbred parents of maize (Zra mays 
L.) hvbrids using simple sequence repeat (SSR-) marker 
profiles for 54 maize hybrids together with their parental 
and grandparent*! genotypes included among a total 
Of 386 inbred lines. The methodology is applicable to 
the investigation of parentage for all progeny developed 
from parental mating without subsequent generations 
of inbreeding. 

MATERIALS AND METHODS 

Algorithm: Consider an index hybrid whose parentage is 
unknown of m dispute, Inbreds in an available database are 
possible ancestors of the hvhrkl. Tiie objtraWc is to find the 
probabilities of closest ancestry for each inbred Oil the basts 
of information from SSRa from the index hybrid and the 
inhrcrt*. There is no reason CO trim the d:ir;il\vr by removing 
inbreds thought to be unrelated tu rhe units hybrid because 
their lack of relationship will be discovered. 

Consider a pair of possible ana-sron- mhred / and inbred 
j. There is nothing special about ihi> panic utor pair as all 
|..iii-N ^ill fronted simihrlv. TV prm-i-** invokes ralrulaimg 
[b«. ^n.bubilicv that hihrcd* / and ./aiv in ihr hvhrid"* ancestry. 
iepe;imu: this for ail pairs of inbred), in rhr dinabxu-. 
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ruiUiltili'. •»! Viii 

The husis ol ihv jljfriilim i» EUcV rule {*.<;. d>kky IVVl. 
With U-i H ilSSfci mjmiI tof nV (pixtcnni > pi.ih.ihililv 
thut'iuiCl ;»vc ancesmr* ..f the index hvbrid given flu- mi^uM- 
tiort from [he ^..its .W*. Let FT i, /> stand fof uV unmnr 
liwtul ('jr uru»n probable of the wmc wni. FmalK. 
«SSR»li. /> w thr probability i,f observing the various W>K 
results if in fact / n.ul / arc anchors. Bayes' rule say* 
rU/l5SR»i - fiSSMi. }> < PI'. /> el x P»... n]. 

where the sum in the denominator is over all pairs of* inbreds. 
indexed bv u and * flSSfci*./) * fUy> & of the term* 
in the denominator. {To compute the denominator w the 
above expression ns a particular order to the inbreds m the 
database and cake n < vxn expression* involving the pair (u, 
v) If there are 5S6 inbreds. for example, then chc number 
of pairs and the number of terms in the denominator is 
3 g6(537)/2 = 171.991.) Inbreds * and; may be parents or 
mndparents or other tvpes of relations or bear no relation- 
ship at all to the hvbricLlf there arc more than wq ancestors 
in the database, such as both parents and all four grandpar- 
ents, then the possible pairs involving these ancestors will 
n-nerallv have the highest posterior probabilities If the hv- 
brid*> true parents arc m th* database, then as a pair they will 
typicallv have the highest overall posterior probability. If both 
/and /happen to be related co one particular parent of the 
hybrid, then as a pair their posterior probability will be low 
because they will not usually account for many of the alleles 
that are contributed by the other parent of the hybrid. 

We will make the Wprior-inFormltion' assumption that 
F{u. V) is the same for all pairs (k, u). This implies that this 
fector is cancelled from both numerator and denominator in 
the above expression, giving 

P[i.fiSSfts) = PtSSRsli,;)/S^S5Rsl tt . v). 

The problem is then to calculate atypical fXSSRsltJ). Assume 
inbreds /and ; are both ancestors. We calculate the probability 
of observing the resulting hybrid under this assumption. We 
make no assumptions about relationships among the various 
inbrtfds. Other possible ancestors ivill be considered implicit^ 
in the calculation by allowing their alleles to be introduced 
through breedings with I and /. However, the nature of such 
breedings is not specified. Suppose Inbred fs alkies are (a. 
b). Each descendant of inbred l receives one of these two 
alleles or not. An immediate descendant receives one with 
probability I (barring mutations). A second generation de- 
scendant receives one of them with probability 0,3, And *o 
on. Since degree of ancestry- (if any) is unknown, we label the 
actual probability of passing on one of these alleles to be P. 
Similarly, an allele from inbred j has been passed down to the 
hybrid or not, and the probability of the former is ft In the 
following, Pwill be taken to equal 0-30. although we will also 
consider P = 0.D9 in some of the calculations. 

Assuming P = 0.50 is consistent with the closest ancestors 
in the database being grandparents. However, we are not 
interested in grandparents p*r sr. If the closest ancestors In 
the database were parents, then as indicated above P should 
equal I {ignoring mutations and laboratory errors). Our pri- 
mary concern is when the parents are not in the database, in 
this case Pis no greater than 0.30. .Assuming P - 0.50 is robust 
over the middle range of possible values of P. One way m 
which It is robusr is ifrhcre may be mutations and laboratory 
errors, in which case P would have to he <:i. Taking P to 
equal 0.50 levies little peiUUV against a particular pair in which 
there is an apparent exclusion fmm direct parentage. There- 
fore taking P to be <l means that if the truer parcm* J re »» 
the daub.tse then cho will u»c Ho ruled out inhere happen 
to be mutatumsnud ialwattuv errors. And if the rlnsiM anrr*- 
tors in the dmubas*' are mote remote than ;fl;inrl|>.irvuu. thev 



sr. 



are likrK to kU-mihVd b«vaiiM* i\wr> uill umuiIK havt* rlu- 
It west niiMnatchcS of tin-; Km"* rineudrred. 

When j and ; are ancestor* thert: are tour possibilities: { 1 > 
The alleles of both inbred* rand / were passed to the h\brid. 
rd) inbred i came through bin tun inbred /. (fl) inbred ; 
came Through but not inbred t. and \ -\} neither inbred camv 
through, .\ssuming independence, these have respective ppjl)- . 
abilities Ptl - P). PU - rn. {i - h\ tn the ca*e P - 
0.50. all of thciC probahiltties equal ').US. 

Aii instance of die law of total probability (Sec. 3.3. BtRKV 
WW) is that the probability of ob>er%ing a hybrids alleles is 
the average of chc conditional probability of this event given 
the above four cases. The simplest of the four cases is the 
first possibility: Assuming the hybrid's alleles are passed down 
directlv from both inbreds, the probability of observing the 
hybrid's- genocvpe is cither 1 or 0 depending on whether the 
hvbrid shares both inbreds* alleles. (It is especially easy when 
both inbreds are homozygous.) The other three cases require 
an assumption regarding the possibility that an inbred s allele 
is not passed to the hybrid but is interrupted by a mutation, 
a laboratory error, pr'intervening breeding We regard such 

an allele as feeing selected from all know* alleles with probabil- 
ity 1/ (number oFiilkk*). where t.h* numher of alleles is the 
total number of alleles known to exist at the locus in question- 
Art alternative approach would be to use die allelic propor- 
tions that are present In the database (or in another database). 
However, the lines in the database may not be randomly se- 
lected from any population. For example, a line that has heen 
hiifhlv used in'breeding would have many derivative lines in 
the database, in which case the frequencies of its alleles will 
he artificially inflated. Assuming equal probabilities for the 
various alleles at a given locus is robust in the sense that it is 
not affected by adding and dropping lines from the database 
There are many cases co consider when computing the 
probability of observing a hybrid's alleles, depending on the 
zygosity of the hybrid and the inbreds. and allowing for the 
possibility of missing alleles or "extra alleles" in the assessment 
of the hybrid and inbred genotypes. These possibilities are 
too numerous to list. Instead wc give three simple examples. 
All the examples have homozygous inbreds, the most common 
case. And each of the three hybrids has two alleles, again the 
most common case. We suppose chat the measured alleles for 
three SSR* and a particular trio of hybrid and ancestor inbreds 
are as we have indicated In Tabic I. 

For SSR 1 there are three known alleles, one in addition 
to alleles a and b that arc listed for the three lines (hvbrid. 
inbred #, and inbred j) in Table I. for SSR 2 and SSR 3 
there are two known alleles in addition to those listed. The 
calculations in the right half of Table 1 will now be explained. 
Implicit in calculating P^SSR: /, j) is the assumption— required 
in both the numerator and denominator of Bayes* rule— that 
inbreds ( and j are ancestors of the hybrid. Consider SSR I. 
In case i above, both ancestors' alleles (as measured by the 
laboratorv process) are assumed to pass to the Index hybrid 
and so in this case the hybrid is necessarily ab. The probability 
of observing the actual hybrid's genocype is 1 for case I. as 
shown in Table 1- In case * we assume that inbred is allele 
passes to the hvbrid but inbred ;'s does not. Indeed, die hybrid 
has an * allele*. The probability of observing a tt as the other 
allele is 1/ (number of alleles) = 1/3. as shown in Tabic 1. 
Case 3 is similar. In ca*« 4, neither ancestor allele is passed 
to the hybrid; the probability of observing the hvbrid'* ^eno- 
tvpc yor anv heterozygous genotype) is 2t I, 3)il- 31 ' 2. 0- 
Since P= 0*50. the overall i unconditional) probability in the 
rightmost column (17; 36i is the simple iWerogr of the four 
case*, as itirli rated in Table I. 

For SSR 'i and SSR 3 (he calculation are *imiUu\ For S.SR 
1> there is simie evidence aipuiwt pair (/. j) being ancestor*. 
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Probability of otocrving a hybrid's alleys using three sample SSRs and four powiblt combinations {cases) 
of alleles passed, assuming that inbreds i and; are ancestor* of the hybrid 













Pi'ubabiliry of Observing the 
hybrid** geuotypr* 


Ovu-.ill 
probability 


No. of 

SSR alleles 


Hvhriri 


Inbred i 


htbrcd / 


w 


Case * Case :> 
j' t not j not i, j 


Case I 
rwl i, mU j 


1 3 

2 5 

3 6 


ab 
W 

ab 


an 
bb 

Cf 


Bh 
0 


1 

0 
0 


1/3 1/3 
1/5 0 
0 0 


2/9 
2 36 


17/36 
If 100 
2/14-1 



SSR, simple sequence repeat marker profilr. 



but it is not conclusive. For SSR 3 there is even less evidence 
favoring pair [uj)- It would not take many SSRs with evidence 
similar w> that for SSR 3 to essentially rule out this pair— 
provided that otter pairs are not similarly inconsistent. 

To find the overall ftSSRsIf, multiply the individual 
F[SSRI/,^ ° vcr thc ^ rious SSk*' There nr< ^ P urc, y compiua- 
tionnt is&ueS CO address. Each /^SSRl/j) is a number between 
0 and 1. When there are a great many SSRs, thc product of 
chese numbers will be vanishingly small To lessen problems 
with computational underflow, for each SSR we multiply 
PtSSRIu, v) by the same constant for each pair £«, v): the 
inverse of the largest possible such probability. For example, 
since 17/36 is the largest probabili^' for a heqerpzyjous hybrid 
at an SSR having three alletes (as is the case for SSR I in 
Table I), we multiply all factors P(SSRflu. v) by 36/1" To 
eliminate remaining problems with underflow, we do calcula- 
tions using logarithms (adding instead of multiplying) and 
take analogs at the end- 

The probability P(SSRI u, v) is calculated for all ( u, v) pairs 
and summed over all possible pairings in the database, includ- 
ing Chat for the inbred pair under consideration: (i. j). This 
gives the denominator in thc expression for P{i, ylSSRs). 

To determine thc probability' that any particular inbred, say 
inbred i t is the closest ancestor of the index hybrid, sum 
flSSRI/. v) over all inbreds f with v */ L Call this FfilSSfo). 
The maximum of P(ilSSRs) for any Inbred f is I, But since 
there Is one closest ancestor on each side of the family, the 
sum of P(ilSSRs> over all inbrcds i is 2. If there is a particular 
pair U fi for which PU jiSSRs) is close to I then both P(i1S$&>) 
and PfyiSSRs) separately will be close to L 

SSR dafcu DNA was extracted from 54 maite hybrids and 
from 586 maiie inbreds. All of thc hybrids and most inbred* 
are proprietary products of Pioneer Hi-Bred International: 
some important publicly bred inbred lines were also included. 
The inbred parents mid grandparents of each hybrid were 
included within the set of inbreds. Other inbred* chat were 
genoryped include many that are highly related by pedigree 
to parents and grandparents of thc hybrids, The hybrids were 
chosen because each has a pedigree that is known to us and 
collectively they represent a broad array of diversity of m.iize 
gerrnplasm that is currently grown In the United States ranging 
from eaiiv to late maturity. 

A tonlof 195 SSR loci were used in thU study following 
procedures describfd in Smith ft <iL { 1997^ but modified ;v< 
described below. loci were chosen on the basis that they 
individually have been shown to have ;i high power of discrimi- 
nation among m.iiye inhrcrl lines and collectively they provide 
fur a sampling of ili\vr*itv lor earh chromosome ami. Of rhest' 
SSRUiCi. the folk »wins; mimberh ^tn paunithi'icti wei r In ;nfd 
on individual maize rhn*iini*ome« as follow*: 1 ^35), *J i.tidt. 

:t w*\ t i i*joi. ;» [iu.. * \\ } ). 7 iok * ms», d t \ x >\. ;m<i m 



( U) i 1 7 SSR loci have not yet been mapped. The correlations 
among the loci are unknown and are irrelevant for our meth- 
odology. 

Sequence data for primers that allow many of these (and 
other) SSR loci to be assayed arc available at website http: '•' 
ww.agron.mijisouri.edu. All primers were designed to anneal 
and amplify under a single set of conditions for PCR in 10«^l 
reactions. Genomic DNA {10 n^;) was amplified in 1.5 rim 
MgCU. 50 mu KG, 10 nut Tris-Ci (pH 3.3) . using 0.3 units 
AmpliTaq Cold DNA polymerase (PE Corporation) oligonu- 
cleotide primer pairs {one primer qf each pair was fluores- 
eemly labeled) at 0.17 hm and 0.2 aim dNTPs- This mixture 
was incubated at 95* for 10 min (hot start): amptined using 
45 cycles of denaturadon at 95' for 50 sec, anneating ac 50° 
for 50 sec, extension 72 a for 35 sec: and then terminated 
at 72° for 10 min, A water bath therrnocyclcr manufactured 
at Pioneer Hi-Bred International was used for PCR reactions. 
PCR products were prepared for electrophoresis by diluting 
3 u,l of each product to a total of 27 u.1 using a combination 
of PCR products generated from other loci for that same 
maize genotvpe (multiplexing) and/or dH.20. Dilution of 1.5 
jjl! of this mixture to 5 u.1 with gel loading dye was performed: 
it was then elect rophoresed at 1700 V for 1.5 hr on an ABI 
model 377 automated DNA sequencer equipped with GENE- 
SCAN software v. 3.0 (PE'Apptied fciosystems, Foster City, CA). 

PCR products were sized automarically using the "local 
Southern" siting algorithm (Eldcr and Southern* 1937). 
After suing of PCR products uiing CeneScan, alleles were 
assigned using Genocyper software (PE-Applied Biosystcm*) . 
Generally, allele assignations for each locus were made on 
the basis of histogram plots consisting of 0-5-bp bins. Breaks 
be&vcen the histogram plots of >I bp were generally consid- 
ered to consututc separation between allele bins; however, 
other criteria, such as the presence of the nontemplate* 
directed addition of adenine (-i-A addition) and naturally 
occurring 1-bp alleles, were used on a markcr-by-markcr basis 
to define thc alle!e dictionary. All allele scores were made 
without knowing the identities of the maize genotypes. 



RESULTS 

Table 2 presents the probability of closest ancestry of 
the top five ranking inbred lines for each of 3 hybrids 
at P= 0.30 [Table ±\) :md P <* 0.D9 (Table 2B1. Proba- 
bilities of ancestry are shuwn for all 54 hybrids and the 
tt>p ranking mhrt;tls in Figure 1: P 83 O.jO (Figure U) 
and P = 0.99 (FiK»rr Mil. Results far the hybrid* pre- 
sented in Tahh; - uiv ti'.aurvtl it the top of Figure 1. 
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TABLE 2 

Probability of ancestry of five hybrids using data obtained from 50, l00 r and 195 SSR loci 
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•30 luct 



ion Wei 



M.ij lud 



Prob. 



InSiI. 



SE 



Inbd. 



Pnjb. 



SE 



34 IT 



3556 



3905 



3940 



3417 



3525 



3556 



3905 



39-40 



SPl 


0.9607 


0.0123 


PI 


P2 


0.S077 


0.1963 


?H 


D2P2 


o.ioui 


0.1 OSS 


DIF'J 


DIP- 


A. aoj'i" 


A A£VO - 

v.uv- ( 


5r l 


PI 


a a«jo 
0,033 


A A t n - 


uin 


Pi 


0.3543 


E-07 


PI 


P2 


0.3 1 33 


E-07 


P2 


DIPS 


0.1699 


E-07 


DIP2 


CP1 


0.1441 


E-07 


CPI 


GPS 


o.ono 


E4JS 


SPl 


PI 


i.oooo 


E-06 


Pi 


P2 


0.9616 


E-OS 


P2 


D1P2 


0.0340 


E-10 


DIP2 


GP2 


A AAI 1 

0.U043 


E AO 




D2P2 


0.0002 


E-10 


D3P2 


DlPi 


0.9322 


E-08 


D1PI 




0.4927 


E-o: 


SP2 


D2P2 


0.2S36 


E-07 


DIP2 


D IP2 




t?_AT 


U- r — 


P2 


0.0565 


E4)7 


Pi 


P5 


0.9997 


0.000 1 


P2 




0.9203 


0.0009 


PI 


PI 


0.0643 


E-05 


DIP2 


Dlrl 


A A I O™ 


IT A 2. 


D^P 1,1 * 


DP1P2 


0,0014 


0.O009 


DPIP2 


SPl 


0,9995 


0-000 1 


Pi 


P2 


0.8836 


0.1653 


P2 


DIP2 


0.0722 


0.1029 


D1P2 
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0,044 i 


0.062S 


DiPi 


PI 
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o!oooi 
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PI 
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Pi 


P2 
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0 


DIP2 


D 1 PC 
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E-ll 




GP1 


E-0? 


0 


D2P2 


GP2 


E-06 


E-17 


SPl 


PI 


1.0000 


0 


PI 


P2 


0.9996 


0 


P2 


DIP2 


0.0003 


0 


DIP2 


DIP! 


£-11 


0 


D3PI 


D2P1 


E-13 


0 


D2PI 


DIP! 


0-9999 


0 


DIPI 


P2 


0.9992 


0 


P2 


SP2 


0.0006 


0 


DIP2 


D1P2 


E-05 


0 


SP2 


D2P2 


E-06 


0 


D2P2 


P2 


0,9999 


E4)8 


P2 


DIP2 


0.9999 




PI 


Pi 


E-06 


E-13 


D1P2 


DIP! 


E-0S 


E-15 


D2P2 


DPIP2 


E-12 
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A. AiUtmiir^ } y a 
0.JJ7W 
O.SMl 
0.1H39 
0.1213 
0.0009 

0.9999 
0.3437 
0.4563 
E-07 
£J)7 

0.9999 
0.9997 
0.0003 
£-05 
E-06 

0.9803 
0.6230 
0.2321 
0.1317 
0.0197 

0.9999 
0.9970 
0.0030 
0.000 1 
0.0001 

B. Assuming P -■ 
0.9999 
0.9933 
0.0061 
£-05 
E-05 

0.9999 
0.97 fc9 
0.025 
E-20 
E-24 

i.0000 
0.9999 
E-09 
£-21 
Ml 

0.9999 
0.9999 
E-06 
£-07 
£-09 

1.0000 
0.999V) 
E-05 
E-12 
£-21 



- 0..V) 
0.0252 

0.2255 

0.025 

0.0002 

<E-20 

<E*20 

<E-20 

E-18 

<£.20 

£-10 
E-10 
£.14 
E-15 
E-17 

0.0053 
0.0976 
0.0617 
0.0372 
0.003d 

£-05 
0.00 11 
0.001 1 
E-05 
E-07 

= 0.99 

E-05 

0.0107 

0.0107 

£4)6 

0 

0 
0 
0 
0 
0 

0 
0 
0 
0 
0 

E-03 
E-06 
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E-13 
£-10 
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E-05 
E-05 
£-11 
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SPl 
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PI 

P2 
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D2P2 
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P2 

D3P2 

P2 
Pi 

D1P2 

DP1P2 

D2P2 
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P2 
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l.OOiJfJ 
0.9957 
0,00 13 
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E-06 

1.0000 

0-9635 

0.0365 

E-15 

E-16 

1.0000 

1.0000 

E-09 

E-14 

E-17 

1.0000 

1.0000 

E-06 

E-07 

E-10 

1.0000 

1,0000 

E-li 

£.17 

E-I9 



0.9999 

0-9999 

E-ll 

E-14 

£-20 

1.0000 

0.6135 

0.3364 

E-48 

E49 

0.9999 

0.9999 

E-22 

E-49 

E-54 

1.0000 

0.9947 

0.0052 

E-IS 

E-25 

1.0000 

1.0000 

E-24 

E-44 

E-50 



E-07 

O.00j3 

0.0033 

E-06 

E-07 

<E-20 
0.052S 
0.052S 
<E-20 
<E-20 

<E-20 
<E-20 
<E-20 
<E-20 
M7 

E-03 
E-06 
E-06 
E-13 
E-16 

E-09 
£4)9 
E-ll 
E-17 
E-13 



E-03 
E438 
£-11 
£-14 
E-21 

0 

0.-1446 
0.4446 
0 
0 

0 
0 
0 
0 
0 

EJ)9 
E-09 
E-ll 
E-13 
E-25 

E4>9 
E-09 
E-24 
£44 
E-49 



HvbcL hvbrid: InhcL Unbred: Prob.. prob;ibilic% : SE. standard error, referring to the variability m [he results 
oF the runs: PI. parent one: P2. parent cw SPl SP2. full *ihlmj; of parent one paa-nc ^o: D/Pl/D/P 1 ,, 
derivative of piirrm one. pnrem two, index / tor diMina inbretl lines; DPIP2, deriuitive^ of huth parent one 
and piirriit tun. 
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Probability of ancestry (lag l0 ) 
AAA D-PI/P2 XXX 0*crs 3 ? C> Parcm 



.r S-PI/P2 



Flt;L ; k , i -u , Probabilities ofwccMrv. wumiw P - 0.30. for all 54 hvbrid* unci top ranking inhr«li-chose with probability 
of i«Siy ac U-nl l! > ' S Probities of oncac* fuming r> = <M» br all 54 hybncU unci cop rank,,* mbrccb-ih^ 
with pritbiibilii\ ot ancotrv :u lcu.it 10 *\ 
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FicunK \.-—0)ftiifiiit?d- 



When the algorithm used P = 0.50. the two correct 
parents were identified us highest in probability for 48 
(8 or M hvhrids (Figure I). For each of hybrids 

:^R:>2. SOU. and XlHUfvU. one parent 

ranked in die top two places. The other patent was 
supplanted either b\ a si^u-r inbred in- hvun inbred that 



was a direct progeny of that parent. Overall. H)|£ i!)4 e fcl 
of 108 parental mrireds were correctly identified, For 
hybrids where both parents ranked first or second, the 
range of probabilities for parental lines that ranked first 
from among all other inbred* ranged from UinuU u> 
0.9997: parental lines ranking second ranu'rd IVom 
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I 0000 uiO,*M»r»:V F«»r 35 hybrids, both parent had p"jb- 
abililio ol an^uy in excess ofO.tiW. PtvbahiliiieN <>l 
ancestry t'oi- iu <n parents that ranked in hr<t or s-eumd 
places were limn O.yyuy to 0.7034, For the iiia|urii>. cit 
hybrids, the probability of the third and hi-hoi ranked 
notiparcnial inbred was at or below E-Oti. This indicates 
that theft is usually very little uncertainty about eldest 
ancestors. 

When the algorithm used /> - 0.99 to examine each 
of the 54 hvbrids. both parents were correctly identified 
for 52 m% > of hybrids and for 98ft (102* 104) of the 
parents acrott all hybrids (Figure I). Two hybrids (3DU 
and X0915A), in which one parent was not ranked in 

the top two, w « fe in ch * 001 rankcd in lhe 

top two assuming P 0.50 (above). In both rases their 
ranks improved (both to third rank) and the actual 
parent was supplanted by an Inbred that was a direct 
progeny of the corresponding parental line. For 49 hy- 
brids, both parents had probabilities oF ancestry in ex- 
cess oF 0.999. Among the 5 hybrids having a parent 
ranking scewnd with a probability oF ancestry below 
0.999, the lowest oF these probabilities was 0-3976 and 
the highest probability for a third ranking nonparent 
was 0.1023. For most hybrids the probability for the 
third and highest ranked nonparental inbred was at or 
below E-10. 

Table 2 also addresses data analysis in circumstances 
where heterozygous loci occur in inbred lines or where 
a hybrid is scored for the presence of more than two 
alleles per locus. The presence of more than a single 
allele per locus in inbred lines is an infrequent occur- 
rence in well-maintained inbred development and seed 
increase programs but is possible because *v3-5$ of 
loci can still be segregating and unintended pollination 
from genotypes not designated as parents of the hybrid 
can occur. For hybrids, more than two alleles per locus 
can be scored when DNA is extracted from a bulk of 
individual plants and because inbred parents are not 

homo^-gous due either to residual heterozygosity or to 
contamination or because one or more direct parents 
of the hybrid are themselves hybrids. The presence of 
more than one allele per locus in an inbred line and 
more than two alleles per locus in a hybrid therefore 
can be accommodated by multiple runs of the algo- 
rithm, each with a random choice of two alleles per 
locus. Consequently, standard errors in the case of ana- 
lysing data from 193 loci tend to be very small because 
there were Few loci where an inbred or hybrid sample 
(from a bulk of individual planes) was scored for more 
than two alleles. 

Marshall rtd/. (lfcW) have drown attention to errors 
thai can be encountered in genutyping surveys. These 
errors include mining data, null alleles, and typing er- 
rors. We therefore inve-tiu'^rd the robustness of the 
algorithm hv examining the effect* of modifications in 
thl- data tW live hvhiicU '3417. 3325. 353n\ 5VMifi. ami 



#140). First, we reducvd the number of used. From 
the full >vt of IW to loo and then to 30 (Table 2j. Use 
of 50 loci generated imonvcl ranking of one parent 
fur caeh of two hybrid VM\~ and 3U40> and for both 
parents of one hybrid 'Worn, All of these most highly 
ranked nonparental inbreds were closely related to the 
true parenL* for each of the respective hybrids: six differ- 
ent inbred lines were involved, four were direct progeny 
oF the true parents (one with additional backcrosse* 
From the true parent) and two were full sisters (from a 
cross of highly related inbreds) oF the actual parent of 
the hvbrid. Using 100 loci resulted in correct parental 
rankings for all hybrids except for 3905 where neither 
parent ranked in first or second place- Four inbreds 
outranked the true parents of 3905. All Four nonparents 
were closely related to the respective true parents; three 
were direct progeny of the true parent of the hybrid 
(one with additional backcrossing to that parent) and 
one was a full sister of the true parent. Use of data from 
all 195 loci corrected the placement for one of die 
parents of hybrid 3905. Two inbreds that were not par- 
ents of this hybrid remained ranked more highly than 
one of the true parents. Both were direct progeny of 
that parent, and one of these inbreds had additional 
backcrossing to that parent in its pedigree. 

To address the consequences of laboratory and other 
sources of error, we artificially compromised data qual- 
ity beyond the level originally provided by eliminating 
specific proportions of alleles that had been scored (es- 
tablishing scenarios where various numbers of SSR al- 
leles were not scored) and by misscoring other alleles 
(establishing scenarios where various numbers of SSR 
alleles were scored incorrectly'). We also combined the 
scenarios of missing data and wrongly scored data. Table 
3 contains a summary of the results of making these 
modifications in the data. For all modifications we used 
data from all SSR loci and we also randomly chose SSR. 
loci to create subsets of 50 and 100 loci. In each case, 
the program was am 20 times for each hybrid/set of 
luCi. When all 195 loci were examined, replications dif 
fered only according to die particular choice of alleles 
for loci where more than two alleles had been scored. 

To evaluate robustness in the face of missing data or 
mistyped daca r we simulated individual and combined 
categories of these data in the hybrid and all inbred 
lines at levels of 2, 3, 10, and 23<5 of the alleles for each 
of five hvbrids and all inbreds bevond che level of error 
as originally scored by the laboratory. We examined the 
effects of these levels and apes of error for three sizes 
of database: 50 loci, 100 loci, and all 195 scored loci. 
The same five hvbrids considered in Table 2 were investi- 
gated: MIT, 3525. 3556. 3903. and 3940. One of these 
hybrids (3905) wis chosen because one of its parents 
did not rank among the top two places even when the 
Complete and unmodified dara from all SSR loci were 
usud. 

txamples of robustness in the face of additional error 
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|m \\\r hvbrid> li.sini» sUPm ts of :V> |tK» loci .nut .ill 
ion are shown in Table 3 when* mimlxns ol puteni> 
ranking into the tap two places Ate presented. Degrada- 
tion in the preferential ranking ol parent inbreds at a 
k-vi'l of2V> additional mUsiu^ rl;ii;i was shown for on** 
hybrid (33 fc J5) with usage of 50. 100, or all SSR loci- 
Degradation in the preferential ranking of parent in- 
bred* ii level of 2')% additional mkscored data was 
shown for hvbrid 3556. When both -additional levels of 
milling and misscored data were simulated, degradation 
in die ability to preferentially rank inbred parents oc- 
curred For all hybrids and for all sets of SSR (50. 100. 
and 195 loci) except for hybrid 3417 when data from 
193 SSR loci were used. Over all five hybrids, use of 100 
loci improved robustness from the use of 50 loci; use 
of 195 loci further improved robustness for four hybrids 
(34l7 f 3525. 3905, and 3940). The degree of improve- 
ment was small, except for hybrid 3905. 

We also ranked inbreds according to their probability 
of ancestry of hybrid* when both parent* and all inbred 
derivatives and full-sister inbreds of the respective in- 
bred parents for each hvbrid were excluded from the 
analysis. The results arc too voluminous to present here 
but can be summarized as follows: Using P = 0-50, a 
grandparent of each respective hybrid ranked into first 
place for 41 (76%) hybrids; probabilities ranged from 
0.4976 to 1.0 and most were above 0.9999. Other classes 
of inbreds that ranked in first position for probability 
of ancestry were inbreds derived direcdy by pedigree 
from a grandparent of the respective hybrid (DGP) for 
13% of hybrids, inbreds derived direcdy by pedigree 
from a greac-grandparent of the respective hybrid 
(DGGP) for 9% of hybrids, and one class {2% of hy- 
brids) with an inbred ranked into first place that was 
direcdy related by pedigree to the great-great-grandpar- 
ent of that hybrid. Inbreds that ranked in second posi- 
tion were related to the respective parents oF the hybrid 
as follows Thirty-one {57% of hybrids) were a grandpar- 
ent of the respective hybrid, 11 (20%) were classed as 
DGP, 7 (13%) were DGCP, 1 (2%) was class DGGGP, 
and 4 (7%) were a great-grandparent (CGP) of the 
respective hybrid. Over all hybrids, two of the four 
grandparents ranked into first and second posldons for 
23 (43% of hybrids); three grandparents ranked into 
the first three positions for 5 (? r r- of hybrids). There 
were no instances where all four grandparents ranked 
into the first four positions. Thh rv hybrids had a grand- 
parent ranked into first position using P » 0.99. The 
number of grandparents ranked into the top five posi- 
tions was 93 (compared to IDS when P = 0.50). The 
number of grandparent* rartkiny: into thr top two posi- 
tions was f>3 (compared to 71 when P = 0.30V The 
mean probability of a grandparent dun ranked into die 
first iwo positions was ft.sVZSS iSP ■= 0.1 when P *» 
0.30 and O.iWU (SI) = O.WlH) when P = O.W. 



DISCISSION 



Thr prevalent use of paternity indices demonstrates 
that it U advantageous to have explicit probabilities of 
ancestn to distinguish among di fle rent pedigrees. Mo- 
leuiLu' marker profile* are rapidly becoming m"K ex- 
tensive and rout effective to generate. Features that would 
advance the statistical analysis of molecular marker data 
to provide explicit probabilities of ancestry include the 
ability to calculate probabilities of ancestry where there 
is no a prion information as to the identity of one f usually 
the maternal) parent and robustness in the face of labo- 
ratory error. 

Maize inbred lines and hybrids provide a very exacting 
set of materials for evaluating the discriminatory abilities 
of molecular data and statistical procedures that are 
employed to interpret those data, Hundreds of maize 
inbred lines of known pedigree together encompass a 
great diversity and complexity of pedigree relationships. 
Some Inbred linen can be very highly related and geneti- 
cally simitar due to their derivation from common par* 
cntage including from parents that are themselves highly 
related. Consequently, relationship categories such as 
"sister" or "parent" when applied to maize inbreds usu- 
ally refer to closer degrees of pedigree relationship and, 
thus* of germplasm and molecular marker profile simi- 
larity than those of the equivalent^' named classes of 
relationship for animal species. Most maize hybrids that 
are widely used in the United States today are con- 
structed from pairs of inbred lines that are unrelated 
by pedigree, each inbred parent having been bred from 
a separate "poor of gcrmplasm. Various degrees of relat- 
edness are possible becween hybrids according to the 
pedigree relationships among their constituent inbred 
patents. 

Using P = 0.99 in the algorithm is more specific for 
identifying parents than using P = 0-30- However, P = 
0.99 is less robust for identifying other relatives, such 
as grandparents. When the algorithm was run at P = 
0.50 there were 6 hybrids for which one parent did not 
rank among the top two most probable genotypes. For 
the remaining 43 hybrids the correct parents were iden- 
tified even in circumstances where other candidate in- 
breds Included not only full-sister lines bred from re- 
lated parents but also inbreds even more closely related 
to the true parent by virtue of being backcross conver- 
sions of the inbred parent of the hybrid. For each of 
die 6 hybrids where a nonparent ranked above a true 
parent, that higher ranked inbred was always either a 
sister or progeny of the outranked taie parent. The 
range of pedigree relationships as expressed by die 
Malecoi coefficient of relatedness (Mau?.cot 1948) that 
was encompassed hy pairs of true parents and more 
highly ranker I inbred relatives of the true parents w-a* 
from 0.H:W n< o «Ttf*0. A coefficient of tVS;i»M'i .ippmxi- 
nuues a rt'htiomhip between inbred A and A' where 
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inbred A' ims been bred I'mm a cms> of inbreds A and 
B with between one and tuu additional li,teki.Tov»cs "I 
the parental inbrucl A. A Malecot coefficient of relation* 
ship of 0.%rtf) closvh approximate* a relationship be- 
tween inbreds A and A" where four additional back- 
crosses of parental inbred A fallow the Initial cross of 
inbreds A and B. 

Running chc algorithm at P = 0.99 in comparison to 
p- 0.50 raises the probability of ancestry for the parents 
while diminishing the probabilities for the third and 
lower ranking candidate inbred lines. Use of the algo- 
rithm aE P - 0-99 increased both the percentage of 
hvbricls with both parents ranked in the first two posi- 
tions (from 39 to 96%) and the percentage of parental 
inhreds that were ranked first and second (from 94 to 
93%), Two hybrids (3914 and X0915A) did not have 
both parents ranked first and second when the algo- 
rithm was run at P = 0.99. For both of these hybrids 
the nonparental inbred that outranked the trite parent 
was itself a product by pedigree from the true parent 
that had been created by an additional four backcrosscs 
of that parent; the Malecot coefficient of relationship 
between the parent of the hybrid and the inbred that 
outranked that parent for these two hybrids was 0.96S6. 

Robustness was tested by evaluating the effects of us- 
ing data from different numbers of loci and by simulat- 
ing additional levels of missing and m Scored data up 
to combined levels of *2d% error beyond that which was 
provided by the laboratory. From our experience, error 
rates of 5 to 10% can occur in S5R profiling of maize 
due chiefly to the combined effects of residual heterozy- 
gosity among seed lots and by deficiencies in the scoring 
of hecerozygotes in hybrids. The additional levels of 
simulated error, therefore, include values (up to ^35^ 
total error) that are well outside of our experience. For 
five hybrids that were examined, increasing the number 
of loci from 30 to 100 (with no additional missing or 
misscored data) did reduce the number of instances 
where inbreds that were not parents of a hybrid out- 
ranked the true parent from four to one. Nonetheless, 
a!! of these inure highly ranked inbreds, although they 
were not themselves the true parents of the respective 
hybrid, were either direct progeny or full sisters of the 
true parent (Table 2), Consequently, if such degrees of 
error can be tolerated in respect of pedigrees for inbreds 
that are identified as parents of hybrids, then SSR data 
from 50 loci of equivalent discrimination ability are 
sufficient. Use of data from 50 loci also evidenced ro- 
bustness in the face of up to i0 r c additional levels of 
either missing or misscored data: no degradation in the 
ability to identify a parent was apparent up to the level of 
10% additional error except for 10 c c additional missing 
and misscored alleles, for one hvbrid t:W25: Table 3). 
However, use of 100 loci increased the proportion of 
true parent* that were correctlv trtewi fieri from 531?. 
(fur 5n Kich to 7W lincan correct piuvtits over all 



le\eK oi error: Table Use of dam from 11*5 loci 
po Aided ijiwui.': resiliency ug.iinat additional levels i;f 
error. However, use of data from H>5 loci was unable to 
provide resiliency against the negative effects of adding 
combined levels fui *25 4 S» of both missing unci misscored 
data (Table 3;. At the 'JS't level of additional poor data 
integrity, inbreds that were not related to the true parent 
of the hybrid outranked the true parent for Jour of die 
five hybrids. Levels of missing or misscored data should, 
therefore, be kept below 15-20*5 (assuming a level of 
5-10% error in the data we analyzed prior to simulating 
additional error). 

We have previously examined the pedigrees of in- 
breds that are ranked into the first two positions when 
the true parents are removed from the list of candidate 
inbred lines. Usually, direct progeny or full sisters of 
the true parents then rank most highly (data not pre- 
sented). We therefore examined the rankings of inbreds 
with respect to their ranking and probability of inclusion 
in the ancestry of each hybrid after the removal, not 
only of the true parents, but aUo of the progeny of the 
true parents and any full sisters of the true parents. In 
these circumstances the grandparents of the hybrids are 
ranked predominantly into top positions. Using P - 
0.50. a grandparent ranked Into first position for 76% 
hybrids and into second position for 57% hybrids; with 
P - 0.99 a grandparent ranked into first place in 56% 
of hybrids. At P » 0-50 two grandparents ranked into 
first and second positions for 43% hybrids and into the 
first three positions for an additional 9% hybrids. Most 
of the remaining inbreds that ranked into the top two 
positions were progeny of the grandparent A total of 
103 grandparents ranked into the top five positions 
when P - 0.50; 93 ranked into these positions when P - 
0,99. Seventy-one grandparents ranked into the top two 
positions when P = 0.50; 55 grandparents ranked into 
these positions when P = 0.99, The mean probability 
of a grandparent in the top two positions was 0.9288 
(SD 0.1454) when P = 0.50 and 0.9980 (SD 0.0104) 
when P = 0-99. Our algorithm was written to identify 
pairs of ancestois; alternative algorithms could be tai- 
lored to identify all grandparents once parents had been 
identified and removed from the list of candidate in- 
breds. 

We have demonstrated the capability and robustness 
of an algorithm that can be used to show probability of 
parentage in circumstances where the a prion pedigree 
identic)' of neither parent is known. Exclusions arc taken 
into account, thereby allowing parentage to be shown 
even when the two parents are not represented in the 
database of molecular profiles thai are examined. Het- 
erozygous candidate parents can be accommodated. 
The number of loci that is necessary to provide a reliable 
basis of determining pedigree is dependent upon the 
degree of relatedncss amonsf parents and nonpatents 
and upon the diK'rimituuotv ability of die marker sy^ivm 
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in the sprcirs of MUcffsc. l*sm«J /* = (UH) compared \o 
P = CJ.r»0 prclVi eiui.dK itk-iuiliwd tin TV true LntieUL* ;tml 
with a greater difference of probability to third pUiced 
noiiparuuLs. If thvre U rtMsututhle assurance ch;U the 
parents are among the candidate list of Inbred*, then 
p = D.'jy shuulcl be used; if greater robustness \$ re- 
quired, then P = 0.3O dhould be used. 

Application* of our algorithm include die identifica- 
tion of pedigrees among individuals of plunc or animal 
species where molecular profile daosets exist that can 
be interpreted in terms of segregating alleles at Individ- 
ual marker tod and that provide a sufficient power of 
discrimination. Capabilities to generate large datasets 
of suitable molecular profile data are already available 
and are increasing rapidly with the advent of single 
nucleotide polymorphisms. One further application of 
our algorithm is to assist in the protection of intellectual 
property that is obtained on plant varieties or upon 
specific dams or sires of animals through the determina- 
tion Of pedigrees. 
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Attorney Docket No. 1364 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
Applicant: Mark David Hoffbeck Date: March 1 1 , 2003 

Serial No.: 09/758,790 Group Art Unit: 1638 

Filed: January 11, 2001 Examiner: David T. Fox 

For: "INBRED MAIZE LINE PH5WB" 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

RULE 132 DECLARATION 
OF 

DR. DINAKAR BHATTRAMAKKI 

Sir: 

I, Dinakar Bhattramakki, Ph.D., do hereby declare and say as follows: 

1 . | am skilled in the art of the field of the invention. I have a Ph.D. in Piant 
Molecular Genetics from the University of Illinois at Urbana-Champaign. I have a 
Bachelor of Science degree in Agricultural Sciences from the University of 
Agricultural Sciences, Bangalore, India. Since 1997 1 have been engaged in the 
analysis of molecular markers for plants. I have supervised the Molecular Marker 
Applications lab at Pioneer Hi-Bred International, Inc. from January 2002 until the 
present. 

2. I am familiar with the methods used in the analysis of Simple Sequence 
Repeat, SSR, marker data for inbred PH5WB conducted at Pioneer Hi-Bred 
International. Inc. The analysis of the SSR profile of inbred PH5WB may be 
accomplished without any undue experimentation. The SSR profile for inbred 
PH5WB is attached hereto. 

3. Means of performing this genetic marker profile are well known in the art. 
SSRs are genetic markers based on polymorphisms in nucleotide sequences. The 
PCR™ detection of SSRs is accomplished by using two oligonucleotide primers 
flanking the polymorphic segment of DNA. Amplification is accomplished through 
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repeated cycles of heat denaturation of the DNA followed by annealing of the primers 
to their complementary sequences at low temperatures, and extension of the 
annealed primers with DNA polymerase. 

4. Markers are scored following amplification and gel electrophoresis of the 
amplification products. Scoring of marker genotype is based on the size or weight of 
the amplified fragment. While variation in the primer used or in laboratory 
procedures can affect the reported marker score, relative values remain constant 
regardless of the specific primer or laboratory used. 

5. Primers that may be used to identify the SSR markers reported herein are 
publicly available and may be found in the Maize DB on the World Wide Web at 
agron.missouri.edu/maps.html (sponsored by the University of Missouri), in 
Sharopova et al. (Plant Mol. Biol. 48(5-6):463-481) and/or in Lee et al (Plant Mol. Biol. 
48(5-6); 453-461). Markers shown for PH5WB are the publicly available markers in 
the sources listed above for which PH5WB was tested and shown to be homozygous. 

6. Map information is provided by bin number as reported in the Maize DB. The 
bin number digits to the left of decimal point typically represent the chromosome on 
which such marker is located, and the digits to the right of the decimal typically 
represent the location on such chromosome. 

7. I hereby declare that all statements made herein of my own knowledge are 
true and that all statements made on information and belief are believed to be true; 
and further that these statements were made with the knowledge that willful false 
statements and the like are punishable by fine or imprisonment, or both, under 
Section 1001 of Title 18 of the United States Code and that such willful false 
statements may jeopardize the validity of the application or any patent issued 
thereon. _ 
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Essentially Derived variety s 



What is an "E ssentiall y Derived Vari e ty "? 

The concept of essentially derived variety was Introduced into the 1991 Act of the UPOV 
Convention in order to avoid plagiarism through mutation, multiple back-crossing and to 
fill the gap between Ptantjreederjs Right? and patents, gap which was becoming 
important due to the development of the use of patented genetic traits in genetic 
engineering. 

An essentially derived variety is a variety which is distinct and predominantly derived 
from a protected initial variety, while retaining the essential characteristics of that initial 
variety. 

As indicated as an example in the UPOV Convention, essentially derived varieties may 
be obtained by the selection of a natural or induced mutant, or of a somaclonal vanant. 
the selection of a variant individual from plants of the initial variety, back-crossing, or 
transformation by genetic engineering. 

The commercialization of an essentially derived variety needs the authorization of the 
owner of the rights vested in the initial variety. 

The concept of essentially derived variety does not at all abolish the Breeder's 
Exemption as free access to protected plant varieties for breeding purposes is 
maintained. It is not a threat to biodiversity. On the contrary, it favors biodiversity, 
encouraging breeders developing and marketing original varieties. 
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OTOV Publication No. 644(E) , Section 1 



INTERNATIONAL CONVENTION 
FOR THE 

PROTECTION Of NEW VARIETIES OF PLANTS 

of December 2, 1961, as revised 
at Geneva on November 10, 1972, 
On October 23 f 1978, and 
on March 19, 1991 



adopted by the Diplomatic Conference 
on March 19/ 1991 

reproduced from OPOV Publication No, 438(E) 
issue No* 63 of "Plant Variety Protection" 



1991 Act of the Convention 
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Article 13 
Examination the application 

. ; Any decision to grant a breeder's eight shall require an examination for" 
compliance with the conditions under Articles 5 to 9. 'In the T eourae of the : 
examination, the authority any grow the variety or carry out other necessary' 
ftta, cause the growing of the variety oe the carrying out of other necessary 
teat*/ or take into account the results of growing tests or othar trials which 
have already seen carried out. For the purposes of examination, the authority' 
nay require tha broader bo furnish all the necessary information/ documents or 
material* 



Article 13 
Provisional Protection 

Each Contracting Party shall provide measures designed to safeguard the 
Interests of the breeder during the period between the filing or the public*- 
) tion of the application for the grant of a breeder's right and the grant of 
that .right. Such measures shall have the effect that the holder o£ a breeder's 
right shall at least be entitled to equitable remuneration from any person who, 
during the said period* has care lad out acta which* once the right is granted, 
require the breeder's authorization as provided in Article 14. A Contracting 
Party say provide that ths said neaeures shall only take effect in relation to 
persons whoa the breeder has notified of the filing of the application. 



G9AP9BU V 
THE RIGBfS OF IBB BRKSDgR 

Article 14 
Scope of the Breeder's Right 

(1) [ Acts In respect of the propagating material 3 (a) Subject to Articles 15 
and 16 1 the following acts in respect of the propagating material of the pro- 
tected variety shall require the authorization of the breeders 
(i) production or reproduction (multiplication), 
(11] conditioning fox the purpose of propagation, 
(ill) offering for sale, 
(iv) selling or other marketing, 
(y) exporting t 
<vi) importing, -_ • 

(vli) stocking for any of the purposes mentioned in (1) to (vl). above. 

(b) .The breeder may sake his authorization subject to conditions and 
imitations.' . / 

(3) [ Acts In respect of the harvested material ] Subject to Articles 15 
aod,16^. the acta referred to in items (i) to (vli) of paragraph [I) (a) in 
reepecj of harvested material* including entire plants and parts of plants* 
obtained " through the unauthorised use of propagating material of the protected 
variety shall' require the authorisation of the breeder/ unless the breeder has 
had" reasonable opportunity to exercise his right in relation to the said 
propagating material. 



1991 Act of the convention Page 7 
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vaileiy.^ 1 " 16 ' p " ductiw «• "peated „. B of the protected 
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(lii) except for the differences which result froa tha »«t „f 

variety """^ ^ 9W9tnt 0r «»»i«tlo» of g«ot£V V£ inttial 
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tion of a natural oc induced mutant, or of a acaelonal variant IL !!t!!:?L 
of a variant individual froa plants of the inTtial i, b ae - tec ««» 

transformation by gsnetlo engineering. T,,li « t,, back «°" i »9' « 

Article IS 
■xceptim a to the Breeder 'a night 

<D CCowoulsory except, m„ B ] She breeder's right shall not extend to 
CD acts done privately and for ncn-cowwrcial purposes, 
<li) acts done for experimental purposes and 

•hi]- ) .i,r!! TT tBe pUtp0ae oe b *«' J la9 othtr varieties, and, except 
whare the provisions of Article 14(5) apply, aets referred to in Article Mil) 
to (4) in respect of such other varieties. *4UJ 

<*> tpptional exception] HOtWthstanding Article 14, each Contracting Party 
wStrof tM the M b7 , i lBlM to th « ^guarding ^th. "St^iJ ^ 

V«uS L «d.^ J Ztll 1^ iCt "* b "» 1 «'- rt*-t 1« relation to -any 
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1991 Act of the Convention page a 



Received from < 515 334 6883 > at 3/11/03 4:07:36 PM [Eastern Standard Time] 



03/11/03 TUE 16:37 FAX 515 37T6S83 



PIONEER HI -BRED DSM 



Attorney Docket No. 1364 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

Applicant: Mark David Hoffbeck Date: Dec 1 0, 2002 

Serial No.: 09/758,790 Group Art Unit: 1638 

Filed: January 1 1 , 2001 Examiner: David T. Fox 

For: "INBRED MAIZE LINE PH5WB" 

Assistant Commissioner for Patents 
Washington, D.C. 20231 

RULE 132 DECLARATION 
OF 

DR. STEPHEN SMITH 

Sir: 

I, Stephen Smith, PhD., do hereby declare and say as follows: 

1 . I am skilled in the art of the field of the Invention. I have a Ph.D. in Biochemical 
Systematics and Taxonomy of Maize and its Wild Relatives from Birmingham University. 
I have a M.Sc. in the Conservation and Utilization of Plant Genetic Resources from 
Birmingham University. I have a Bachelor of Science degree In Plant Sciences from 
London University. Since 1977 I have been engaged in the development, study and 
application of molecular markers to genetics, measuring genetic diversity and tracking 
pedigrees. I commenced this work at North Carolina State University as a post-doctoral 
research fellow. I have continued my engagement in these studies during my 
employment by Pioneer Hi-Bred from 1980 until the present. These studies have 
resulted in numerous scientific articles that have appeared in peer reviewed scientific 
literature. 

2. I have read and understood the Office Action in the above case dated October 
11 r 2002. This declaration is in response to the Examiner's rejection under. 35 U,S,C, § 
102(e) as anticipated by or, in the alternative, under 35 U.S.C. § 103(a) as obvious over 
F361 as described in Dowden (U.S. Patent No. 5,880,339). 

3. I have conducted an analysis of SSR marker data for inbred PH5WB and the 
inbred cited as prior art, F361. Out of a total of 77 SSR loci examined, which allowed a 
sampling of each chromosome, there are 45 markers that show differences between 
PH5WB and F361 . This represents a difference for 58% for the markers tested. Of 
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these 45 markers, 19 were greater than 50 cM in distance, or unlinked on the genetic 
map. 

4. Upon crossing PH5WB to any other maize line and selfing successive filial 
generations, one would within the realm of what is statistically possible, obtain a progeny 
Inbred maize line that retains genetic contribution from PH5WB. Assuming that (i) the 
cited prior art is used as the maize line to which PH5WB is crossed, (ii) that the only 
difference between PH5WB and F361 are these 45 markers, and (ill) that all markers 
within a 50 cM distance will segregate together, then the odds of obtaining a PH5WB 
progeny inbred that is the same as F361 after one cycle of breeding, is 1 in 2 19 or 1 in 
524,288. Statistically it is extremely unlikely that a PH5WB progeny, after one cycle of 
breeding, would be the same as F361 . 

5. Further, the assumptions made above vastly overstate the likelihood of breeding 
PH5WB from F361 . For example, it Is common practice in quantitative genetics to 
determine the relation of plants by differences in markers. In doing so, one extrapolates 
that a percentage difference in markers is indicative of a difference in the whole genome. 
To assume that the only differences between PH5WB and F361 are for these 45 
markers, when 45 markers constitute 58% of the 77 SSR loci examined, is a gross and 
unrealistic assumption. Further the current maize genetic map only has approximately 
sixty 50cM units, so by applying this limitation the maximum number of independently 
segregating loci one could obtain, using the most different maize lines that could ever be 
found, is sixty. These assumptions result in an over estimate of the odds of breeding 
PHSWBfrom F361. 

6. Given the difference in molecular markers between PH5WB and F361 , it is my 
expert opinion that PH5WB and F361 are very distinct Inventions. It Is also my expert 
opinion that, within the realm of what is statistically possible, any progeny of PH5WB 
developed through crossing PH5WB with another plant will be distinct from F361. Given 
the facts and based on my education and scientific experience, I believe that the 
invention as claimed is not obvious nor anticipated by F361 cited in Dowden (U.S. 
Patent No. 5,880,339). 

7. I hereby declare that all statements made herein of my own knowledge are true 
and that all statements made on information and belief are believed to be true; and 
further that these statements were made with the knowledge that willful false statements 
and the like are punishable by fine or imprisonment, or both, under Section 1001 of Title 
18 of the United States Code and that such willful false statements may jeopardize the 
validity of the application or any patent issued thereon. 
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Date : Orr Tffls Bv: >fl^ A ^^T~ 

Stephen Smith 
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